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- Extensions of time may be available under the provisions of 37 CFR 1 .1 36(a). In no event, however, may a reply be timely filed 
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5) D Claim(s) is/are allowed. 
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Art Unit: 1647 

DETAILED ACTION 

The preliminary amendments filed 1 1/17/2003 and 07/1 1/2001 have been entered. 



Claims 6-8, 1 1, 12 are pending. 

5 

Applicant's election of group I, claims 6-8, 12, in Paper No./the paper filed 
08/22/2003 is acknowledged. Because applicant did not distinctly and specifically point 
out the supposed errors in the restriction requirement, the election has been treated as an 
election without traverse (MPEP § 818.03(a)). 

10 

Applicant's election of the polypeptide encoded by SEQ ID NO: 10 or comprising 
the amino acid sequence of SEQ ID NO: 9 species in Paper No./the paper filed 
12/08/2003 is acknowledged. 

1 5 Claim 1 1 is withdrawn from further consideration pursuant to 37 CFR 1 . 142(b) as 

being drawn to a nonelected invention, there being no allowable generic or linking claim. 
Election was made without traverse in Paper No./the paper filed 08/22/2003. 



Claims 6-8, 12 are being examined. Claim 12 is being examined only to the 
extent that it reads upon the polypeptide encoded by SEQ ID NO: 10 or comprising the 
amino acid sequence of SEQ ID NO: 9 species. 
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Priority 

Applicant has not complied with one or more conditions for receiving the benefit 
of an earlier filing date under 35 U.S.C. 120 as follows: 

An application in which the benefits of an earlier application are desired must 
5 contain a specific reference to the prior application(s) in the first sentence of the 

specification or in an application data sheet (37 CFR 1.78(a)(2) and (a)(5)). The specific 
reference to any prior nonprovisional application must include the relationship (i.e., 
continuation, divisional, or continuation-in-part) between the applications except when 
the reference is to a prior application of a CPA assigned the same application number. 

10 It is acknowledged that the present application contains a specific reference to the 

08/874,474 prior application in the first sentence of the specification. However, the 
specific reference to the 08/874,474 prior nonprovisional application does not include the 
relationship (i.e., continuation, divisional, or continuation-in-part) between the 
applications. The status of nonprovisional parent apphcation(s) (whether patented or 

1 5 abandoned) should also be included. If a parent application has become a patent, the 

expression "now Patent No. " should follow the filing date of the parent 

application. If a parent application has become abandoned, the expression "now 
abandoned" should follow the filing date of the parent application. 

If a benefit claim to a provisional application is submitted without an indication 

20 that an intermediate application directly claims the benefit of the provisional application 
and the instant nonprovisional application is not filed within the 12 month period or the 
relationship between each nonprovisional application is not indicated, the Office will not 
recognize such benefit claim and will not include the benefit claim on the filing receipt. 
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Therefore, a petition under 37 CFR 1.78(a) and the surcharge set forth in 37 CFR 1.1 7(t) 
will be required if the intermediate application and the relationship of each 
nonprovisional application are not indicated within the period set forth in 37 CFR 
1 78(a). Even if the Office has recognized a benefit claim by entering it into the Office's 
5 database and including it on applicant's filing receipt, the benefit claim is not a proper 
benefit claim under 35 U.S.C. 1 19(e) or 35 U.S.C 120 and 37 CFR 1.78 unless the 
reference is included in an ADS or in the first sentence of the specification and all other 
requirements are met. Accordingly, the benefit of the filing dates of the 08/874,474 
nonprovisional application and the 60/020,150 provisional application is denied. 
10 It is acknowledged that Applicants submitted a petition on 1 1/17/2003 to accept 

an unintentionally delayed claim for priority. However, that petition has been dismissed. 
See the paper mailed 05/21/2004. 



Claim Rejections - 35 VSC§102 
1 5 The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 

form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
20 States. 

Claims 6-8, 12 are rejected under 35 U.S.C. 102(b) as being anticipated by De 
Robertis (N). 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 



Application/Control Number: 09/903,188 
Art Unit: 1647 

This rejection is being made because the Office does not recognize Applicants 
benefit claims to the 08/874,474 nonprovisional application and the 60/020,150 
provisional application, as discussed above. 

De Robertis discloses a substantially pure protein characterized by a 
physiologically active form and comprising an amino acid sequence encoded by the DNA 
of SEQ ID NO: 10 (page 25, claim 6). De Robertis's SEQ ID NO: 10 is identical to the 
present application's SEQ ID NO: 10, as indicated below (Qy = the present application's 
SEQ ID NO: 10) (Db = De Robertis's SEQ ID NO: 10): 

AAV14017 

ID AAV14017 standard; CDNA; 1B93 BP. 
XX 

AC AAV14017; 
XX 

DT 09-JUL-1998 {first entry) 
XX 

DE Human "frazzled" frzb-1 cDNA. 
XX 

KW Growth factor; frazzled; frzb-1; Wilts antagonist; human; 

KW tumour suppressor; cancer; ds. 

XX 

OS Homo - sap i ene . 
XX 

FH Key Location/Qualifiers 

FT CDS 61.. 1038 

FT /*tag= a 

FT /product= f rzb-l_protein 

XX 

PN W09748275-A1. 
XX 

PD 24 -DEC- 1997. 
XX 

PF 19-JUN-1997; 97WO-US10942 . 
XX 

PR 18-JUN-1997; 97US-087B474 . 
PR 20-JUN-1996; 96US-00201 50 . 
XX 

PA (REGC ) UN IV CALIFORNIA. 
XX 

PI Bouwmeester T, De Robertis EM; 
XX 

DR WPI; 1998-062760/06. 
DR P-PSDB; AAW41254. 
XX 

PT New isolated growth factors - with neurotrophic, growth or 

PT differentiation factor activity, tumour growth suppressor activity 

PT or mesoderm differentiation activity 

XX 

PS Claim 6; Fig 10; 48pp; English. 
XX 

CC The present sequence encodes Che human growth factor protein 

CC "frazzled" frzb-1. frzb-1 is an antagonist of Wnts in vivo, and 

CC thus is believed to find utility as a tumour suppressor gene, 

CC since overexpressed Wnt proteins cause cancer. Frzb-1 may also be a 

CC useful vehicle for solubi 1 isation and therapeutic delivery of 

CC completed Wnt proteins. 

XX 

SQ Sequence 1893 BP; 516 A; 438 C; 432 G; 507 T; 0 Other; 

Query Match 100.0%; Score 1893; DB 19; Length 1893; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1893; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 GGCGGAGCCH3GCCTTTTGG CCTCCACrrcCG 60 

11 II I IIMII 1 1 III 1 1 1 1 III! II II IIMMimi lit MM III II III I III I! 

Db 1 GGCGGAGCGGGCCnTTTGGCOTCC^CTGCX3CGGCTGCACC 60 



Qy 



61 ATtXjTCTGCGGC^GCCCGGGAQGGATGCTGCTGCTGCG 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 f 1 1 r 1 1 1 1 1 r j i r r r 1 1 1 1 1 
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Db 


61 




Qy 




5 


Db 


121 




Qy 




10 


Db 


iei 




Qy 


24 1 




Db 




15 


Qy 


301 








20 


Qy 


361 




Db 






Qy 


421 


25 


Db 


421 




Qy 


461 


30 


Db 


461 




Qy 


541 




Db 




35 


Qy 


601 








40 


Qy 


661 




Db 






oy 


721 


45 


Db 


721 




oy • 


761 


50 


Db 


761 




Qy 


84 1 








55 


Qy 


901 




Db 


901 


60 


oy 


961 




Db 


961 




Qy 


1021 


65 


Db 


1021 




Qy 


1061 


70 


Db 


1061 




Qy 


1141 




Db 


1141 


75 


Qy 


1201 




Db 


1201 


80 


oy 


1261 




Db 


1261 




Qy 


1321 


85 


Db 


1321 




Qy 


1361 


90 


Db 


1361 




Qy 


1441 




Db 


1441 



61 ATOGTCT(KX3GCAGCCCGGGSU3GaATGC^ 120 

GCTCTCTOCXrroCTCCGGGTGCCCGGGG^ ISO 

lllllllllllllllllllllillltlMllllltlllMlllllllfllllllllllll 

GCTCTCTGCCTGCTCCGGCnXXXXXX3aa 180 

CCCCTtjIXjCAAOTCCCTCCCCTGGAAC 240 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIMIIIIM 

CCCCTGTOCAAGTCCCTGCCCTGGAACATGftCT 240 

ACTCAGGCCAACGCCATCCTGGCC»TC^ 300 

llllllllllllllltllllllllllllllllllllllllllllfllllllllllllMI 

ACTCAGGCCAACGCCATCCTX3GCCATCXIAGC 300 

AGCCCCGATCTGCTCTTCTTCCTCTGT^ 360 

illlllllllllllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIHM 

AjGCCCCX^TCTGCTCTTCTTCCTCTIXrrc 360 

CAQCACEACKTCCATCAAGCCCT^ 420 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllllt 

CAGCACGAGCCCAT<^U\GCCCTGTAAOT 420 

CCCATACTCATCAAaTACCX3CXACTOGTGGCCX3QAaAACCTGGCCTGC^ 480 

MlllllllllllllltlllltlltlllllllllllllllllllllllllllMI 

CCCATACTCATCAAGTACXXICCACTCGTGGCaX^ 480 

GTGTACGACAGGGGCGTGTGCATCTCTCCCGAGGCCTV^ 540 

IIIIIIHIIIIIflllllllMllllillllllllllllllllllllllllllHIIII 

GTOTACGACAGGGGCGTGTGCATCTCrCCas^^ 540 

TTTCCTATGGATTCTAGTAACGaAAACrGT^ 600 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIII 

TTTCCTATGGATTCTAGTAAOKjAAACIKTn^^ 600 

AAGCCTATTAGACKTTACACAGAWiACCTATTTCC 660 

iiiiiiiiiiiiiiiiiiiiitiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiMii 

AAGCCTATTAGAOCTAOVCAGAAGACCTATTTCOGGAACAATTACAACTATC 660 

GCTAAAGTTAAAOAGATAAAGACTAAGTGCCATaATGT^ 720 

I llllllll III I till IIIIMIM IjINIMIIII MIIIMI M II IIIIIMIM 

G CTAAAGTTAAAGAGATAAAGACTAAGTG CC^TGATOTGA 720 

GAGATTCTAAAGTCCTCTCTGGTAAACATTCCACGGGACACTGTCA^ 780 

IIIIIMMIIMIIIIIMIIIIIIIIMIIIIIIIMIIIIIIIIIMIMIilllll 



j|j||IMIIIIIIIIIIIIIIIII!lllllllllltllllllllflMII!llll!ll 

ATGTTAATGAGGAATATATCATCATGGGCTATGAA 840 



iiiiiiiu iii i iii i ii i Mill i iiMini i iii i in ii ii ii ii iiiiiiiii i 

841 OATGAGOAACCTTCCAGAT 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

rGACTCGGTAAAAAAGTTAAG CGCTGGGATATG^GCTTCGTCATCTTGGACTCAGTAA/ 
^GTGATTCTAGCAATAGTGATTCCACTCAGAGTCAGAAGTCTGG CAGGAACTCGAACCCC 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiiiiiiiiiiii 



llilllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CX3GCAAGCACXK^VACTAAATCCOGAAATACAAAAAGTAACACAGTGGACTTCCT 1080 



llllllllllllllllllllllllllllllllllllllllllllllllllllllillll 



IIIIIMIIIIIIIIII IIMIIIIIIIIIIIIIIIIIIIIIIIIMIIIMMIIMII 

TCTGC 1200 



llllllillllllllllllllllllllllllllllllllllllilllllllllllllll 



IIIIIIIIIMM Nil IIIIIIIII lllllillllll MIIIMI III! IIIHIMII 

GTTTTCTATTTaVCTAATCATGAaAAAAACIVlM'ClMUUGCAATAATAATAAATTAAA^ 

TGCTGTTACCAGAGCCTCTTTGCTGAGTCTCCAGATGTTAATTTACTT^ 

I I I I I I I I I t 1 I 1 I I I I I I 1 I I I I I I I 1 I I I I I I 1 1 I I I 1 I 1 I I I I I I I I 1 1 I I I 1 I I f I 
TGCTGTTACCAGAGCCTCTTTGCTGAGTCTCCAGAT^ 

TTGGGAATG C^TATTGGATGAAAAGAQAGGTTTCTGGTATTCACAGAAAG CTAGATATG 

i iiiiiiii iii i n 1 1 iiinn i nil n iiiimi immm 1 1 ii niiinn i 

TTGGQAATG GAATATTGGATGAAAAGAGAGGTTT CTGGT ATTCA CAGAAAG CTAGATATG 
CCTTAAAACATACrrCTGCCGATCTAATTACAGCCTTATTTTTGTATC 

I II llllll III Ml I M IMIII 1 1 III II II Mil 1 1 III Mill I II IIMI MM I 
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Qy 1501 CTCCTCAlTXTrTAGBUVAJQTTCCAAATGTTTO 1560 

II Mill Mill 1 1! II Hill! Iltll IIIIII!! Illll llllllllii llllll 1 1 1 

Db 1501 CTCCTtlATOCTTAQAAAOTTCCAAATGTTTATAAAGQTA 1560 

Qy 1561 TUTCACATAGGCAAAGCAATCAADCIACCAGGAACn^ 1620 

II Mill Illll I II II IMMMIMI IMIMIMI lllllll III llllll I Mill 

Db 1561 TCTTCAC^TAGGaiAAGCAATaU^ 1620 

Qy 1621 TGAATTATTTTTTW1ACTGTC2U3GAAGTAAAATA 1680 

II MIMMMI I II 1 1 MM lllllll IMIIMIIMM MIMMMIIIIMIMI 

Db 1621 TGAATTATTTTTOAGACTGTCAGGAAGTAAAATAAATAGGAJGCTTA^^ 1680 



15 1681 T^mTmT^ 1740 

Db 1681 OCCTGATTGAOAAGCACAACTGAAACCAGTAG^ 1740 



Qy 1741 CTTTTGGCAATACATTTQATTTGTTCATGAATATATTAATCAI3 1800 

IIMMI1IIII! IIIMIIMIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIMIMI 

Db 1741 CTITTTGGCAATACATTTaATTTGTTCATCAATATATTAATCAG 1600 

Qy 1801 ATAACTAGACATCTGCTGTTATCACCATAGTTTTGTTTAATTTC I860 

1 1 1 1 1 1 1 1 1 M I M 1 1 M M 1 1 1 1 1 1 1 1 I MINI I 1 II I M I II II I M II 1 1 M 1 1 M 

Db 1801 ATAACTAGACATCTGCTGTTATCAC CATAGTTTTGTTTAATTTGCTTCCTTTTAAATAAA 1860 

Qy 1861 CCCATTGGTQAAAGTCAAAAAAAAAAAAAAAAA 1893 

1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 m i f i 

Db 1861 CCCATTGGTGAAAOTCAAAAAAAAAAAAAAAAA 1893. 



30 De Robertis' s SEQ ID NO: 10 encodes the amino acid sequence of SEQ ID NO: 9 

and SEQ ID NO: 9 is the amino acid sequence of human frzb-1 (page 6, lines 29-31; 
Figures 9 and 10). De Robertis' s SEQ ID NO: 9 is identical to the present application's 
SEQ ID NO: 9, as indicated below (Qy = the present application^ SEQ ID NO: 9) (Db = 
De Robertis's SEQ ID NO: 9): 



ID AAW41254 standard; protein; 325 AA. 
XX 

AC AAW41254; 
XX 

DT 09-JUL-1998 (first entry) 
XX 

DE Human "fraz2led" frzb-1. 
XX 

KW Growth factor; frazzled; frzb-1; Wnts antagonist; human; 

KW tumour suppressor; cancer. 

XX 

OS Homo sapiens. 
XX 

PN W0974B275-A1 . 
XX 

PD 24-DEC-1997. 
XX 

PF 19-JUN-1997; 97WO-US01 0942 . 
XX 

PR 20-JUN-1996; 96US-0020150P. 
PR 18-JUN-1997; 97US- 00878474 . 
XX 

PA {REGC } UN IV CALIFORNIA. 
XX 

PI De Robertis EM, Bouwmeester T; 
XX 

DR WPI; 1998-062760/06. 
DR N-PSDB; AAV14017. 
XX 

PT New isolated growth factors - with neurotrophic, growth or 

PT differentiation factor activity, tumour growth suppressor activity or 

PT mesoderm differentiation activity. 

XX 

PS Claim 6; Fig 9; 48pp; English. 
XX 

CC The present sequence is the human growth factor protein "frazzled" frzb- 
cc l. frzb-l is an antagonist of Wnts in vivo, and thus is believed to find 
CC utility as a tumour suppressor gene, since overexpressed Wnt proteins 
CC cause cancer. Frzb-1 may also be a useful vehicle for solubi 1 isation and 
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5 



15 
20 



CC therapeutic delivery of complexed Wnt proteins 
XX 

SQ Sequence 325 AA; 

Query Match 100.0%; Score 1738; DB 2; Length 325; 

Best Local Similarity 100.0%; Pred. No. 7.1e-16S; 

Matches 325; Conservative 0; Mismatches 0; Indels 0; Gaps 



A rt Qy 1 MVCGS PGGMLI^JUU3L1JUAALCLIJ?VPGARAAACEPVR I PLCKS LPWNNTTKMPNHLHHS 60 

10 iiiiiiiiiiiiiiiiiiiiiiiililtiitllilillilllllllllllllliiiilli 

Db 1 MVCGSPGGMLLUlAGLLALJUUiCLLRVPCUUlAAAOT 60 

Qy 61 TQANAIIAIKQFEGLLOTHCSPDIJJFLCAMYAPICTIDFQHKPI^ 120 

lllllllllllllllllllllllllftltllllllllllllllllllHIIIIIIIMII 
Db 61 TQANAI LAI EQFEGLLOTHCSPDLLFFLCAMYAP I CT I DFQHEP I KPCKSVCBRARQGCE 120 

Qy 121 P I L I KYRHSWPKNXACEE LPVYDRGVC I S PEA I VTADGAD FPMDS SNGNCROAS SERCKC 180 

IIIIIIIIMIItlllllllllllltllllMIIIIMIIIIIIIIIIIIIIIIIINIE 

Db 121 PI LI KYRHSWPENLACKELPVYDRGVCI SPEAI VTADGAD FPMDSSNGNCRGASSERCKC 180 

Qy 181 KP IRATQKTYFRNN YNYVI RAKVKE I KTKCHDVTAWEVKE I LKSSLVNI PRDTVNLYTS 240 

IMIIIIIIIIIIIIIEIIIIIIIIIIIItllMIIIIMIIIIIIIIIIIIIIIIIIII 

Db 181 KP IRATQKTYFRNNYN YVI RAKVKK I KTKCHDVTAWEVKE I LKSSLVN I PRDTVNLYTS 240 

25 Qy 241 SGCLCP PLNVNEEY 1 1 MGYEDEERSRLLLVEGS I AE KWKDRLGKKVKRWDMKLRHLGLS K 300 

lllllllllllllllllllllltllllltlllllllllllMIIIIIIIIIIIIMIIII 
Db 241 SGCLCPPLNVNEEY 1 1 MGYEDEERSRLLLVEGS I AEKWKDRLGKKVKRWDMKLRHLGLSK 300 

_ _ Qy 301 SDSSNSDSTQSQKSGRNSNPRQARN 325 

30 || | mil || IM | mil MM II 

Db 301 SDSSNSDSTQSQKSGRNSNPRQARN 325. 



De Robertis also discloses a complex comprising a substantially pure frzb-1 
protein complexed with at least one Wnt protein (claim 12, page 26). Accordingly, De 
35 Robertis discloses a complex comprising a substantially pure frzb-1 protein comprising 
the amino acid sequence of SEQ ID NO: 9 complexed with at least one Wnt protein. 

Claim Rejections - 35 USC § 112 

The following is a quotation of the second paragraph of 35 U.S.C. 112: 

40 The specification shall conclude with one or more claims particularly pointing out and distinctly 

claiming the subject matter which the applicant regards as his invention. 

Claim 12 is rejected under 35 U.S.C. 1 12, second paragraph, as being indefinite 
for failing to particularly point out and distinctly claim the subject matter which applicant 
45 regards as the invention. 

The present specification discloses that "substitutional, deletional, or insertional 
mutants of the novel polypeptides may be prepared by in vitro or recombinant methods 
and screened for immuno-crossreactivity with cerberus, frzb-1, or PAPC and for cerberus 
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antagonist or agonist activity" (page 5, lines 31-35). Hence, it is unclear how to construe 
the term "frzb-l protein" because it is unclear if "substitutional, deletional, or insertional 
mutants" are encompassed by the term "frzb-l protein." The metes and bounds are not 
clearly set forth. 

5 

Conclusion 



No claims are allowable. 



Any inquiry concerning this communication or earlier communications from the examiner should be 

DIRECTED TO DAVID S. ROMEO WHOSE TELEPHONE NUMBER IS (571 ) 272-0890. THE EXAMINER CAN NORMALLY BE REACHED ON 

1 0 Monday through Friday from 7:30 a.m. to 4:00 p.m. If attempts to reach the examiner by telephone are 
unsuccessful, the examiner's supervisor, gary kunz, can be reached on (571) 272-0887. 

IF SUBMITTING OFFICIAL CORRESPONDENCE BY FAX, APPLICANTS ARE ENCOURAGED TO SUBMIT OFFICIAL 

correspondence to the following tc 1600 before and after final rlghtfax numbers: 
Before Final (703) 872-9306 
1 5 After Final (703) 872-9307 

Customers are also advised to use Certificate of Facsimile procedures when submitting a reply to a 
non-final or final Office action by facsimile (see 37 CFR 1 .6 and 1 .8). 

Faxed draft or informal communications should be directed to the examiner at (571 ) 273-0890. 
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Bfn>QDp^M f CARpyjyq and 
EBpuMt EM>yciPfi factors 



5 Field of the Invention 

The Invention generally relates to growth 
factors , neurotrophic factors, and their Inhibitors, and 
more particularly to several new growth factors with 
neural, endodermal, and cardiac tissue inducing 
10 activity, to complexes and compositions including the 
factors, and to DNA or RNA coding sequences for the 
factors. Further, one of the novel growth factors 
should be useful in tumor suppression gene therapy. 

This application claims the benefit of U.S. 
15 Provisional Application Ho. 60/020,150, filed June 20, 
1996. 

This invention was made with Government 
support under grant contract number HD-21502, awarded by 
the National Institutes of Health. The Government has 
20 certain rights in this invention. 

Background of the Invention 

Growth factors are substances, such as 
polypeptide hormones, which affect the growth of defined 
populations of animal cells in vivo or in vitro, but 
25 which are not nutrient substances. Proteins involved in 
the growth and differentiation of tissues may promote or 
inhibit growth, and promote or inhibit differentiation, 
and thus the general term "growth factor" includes 
cytokines, trophic factors, and their inhibitors. 
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Widespread neuronal cell death accompanies 
normal development of the central and peripheral nervous 
systems. Studies of peripheral target tissues during 
development have shown that neuronal cell death results 
5 from the competition among neurons for limiting amounts 
of survivor factors ( "neurotrophic factors H ) . The 
earliest identified of these, nerve growth factor 
("NGF"), is the most fully characterized and has been 
shown to be essential for the survival of sympathetic 

10 and neural crest-derived sensory neurons during early 
development of both chick and rat. 

One family of neurotropic factors are the 
Wnts, which have dorsal axis-inducing activity. Most of 
the Wnt proteins are bound to cell surfaces. (See, 

15 e.g., Sokol et al.. Science, 249, pp. 561-564, 1990.) 
Dorsal axis-inducing activity in Xenopus embryos by one 
member of this family (Xwnt-8) was described by Smith 
and Harland in 1991, Cell, 67, pp. 753-765. The authors 
described using RNA injections as a strategy for 

20 identifying endogenous RNAs involved in dorsal 
patterning to rescue dorsal development in embryos that 
were ventralized by UV irradiation. 

Another member of the growth and neurotropic 
factor family was subsequently discovered and described 

25 by Harland and Smith, which they termed "noggin." 
(Cell, 70, pp. 829-840 (1992).) Noggin is a good 
candidate to function as a signaling molecule in 
Nieuwkoop's center, by virtue of its maternal 
transcripts, and in Spemann's organizer, through its 

30 zygotic organizer-specific expression. Besides noggin, 
other secreted factors may be involved in the organizer 
phenomenon. 

Another Xenopus gene designated "chordin" that 
begins to be expressed in Spemann's organizer and that 
35 can completely rescue axial development in ventralized 
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embryos was described by Sasai et al., Cell, 79, pp. 
779-790, 1994. In addition to dorsalizing mesoderm, 
chordin has the ability to induce neural tissue and its 
activities are antagonized by Bone Horphogenetic 
5 Protein-4 (Sasai et al., Nature, 376, pp. 333-336, 
1995). 

Therefore , the dorsal lip or Spemann 1 s 
organizer of the Xenopus embryo is an ideal tissue for 
seeking novel growth and neurotrophic factors • New 

10 growth and neurotrophic factors are useful agents, 
particularly those that are secreted due to their 
ability to be used in physiologically active, soluble 
forms because these factors, their receptors, and DNA or 
RNA coding sequences therefore and fragments thereof are 

15 useful in a number of therapeutic, clinical, research, 
diagnostic, and drug design applications. 

SuimwnT*Y ? f the Invention 

In one aspect of the present invention, the 
sequence of the novel peptide that can be in 

20 substantially purified form is shown by SEQ id N0:1. 
The Xenopus derived SEQ ID N0;1 has been designated 
n cerberus, N and this peptide is capable of inducing 
endodermal, cardiac, and neural tissue development in 
vertebrates when expressed. The nucleotide sequence 

25 which, when expressed results in cerberus, is 
illustrated by SEQ ID NO: 2. Since peptides of the 
invention induce endodermal, cardiac, and neural tissue 
differentiation in vertebrates, they should be able to 
be prepared in physiologically active form for a number 

30 of therapeutic, clinical, and diagnostic applications. 

Cerberus was isolated during a search for 
molecules expressed specifically in Spemann 's organizer 
containing a secretory signal sequence. In addition to 
cerberus, two other novel cDNAs were identified. 
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The Xenopus derived peptide that can be 
deduced from SEQ ID NO: 3 encodes a novel protein we had 
earlier designated as "frazzled," a secreted protein of 
318 amino acids that has dorsalizing activity in xenopus 
5 embryos. We now designate the novel protein as 
"frzb-1." The gene for frzb-1 is expressed in many 
adult tissues of many animals, three of the cDNAs 
(Xenopus, mouse, and human) have been cloned by us. The 
accession numbers for the Xenopus, mouse, and human 

10 frzb-1 cDNA sequences of the gene now designated frzb-1 
are U68059, U68058, and U68057, respectively. Frzb-1 
has some degree of sequence similarity to the Drosophila 
gene frizzled which has been shown to encode a seven* 
transmembrane protein that can act both as a signalling 

15 and as a receptor protein (Vinson et al., Nature, 338, 
pp. 263-264, 1989; Vinson and Adler, Nature, 329, pp. 
549-551, 1987). Vertebrate homologues of Frizzled have 
been isolated and they too were found to be anchored to 
the cell membrane by seven membrane spanning domains 

20 (Hang et al., J. Biol. Chem., 271, pp. 4468-4476, 1996). 
Frzb-1 differs from the frizzled proteins in that it is 
an entirely soluble, diffusible secreted protein and 
therefore suitable as a therapeutic agent. The 
nucleotide sequence derived from Xenopus that, when 

25 expressed, results in frzb-1 protein is illustrated by 
SEQ ID NO: 4. The frzb-1 protein derived from mouse is 
shown as SEQ ID NO: 7, while the mouse frzb-1 nucleotide 
sequence is SEQ ID NO: 8. The human derived frzb-1 
protein is illustrated by SEQ ID NO: 9, and the human 

30 frzb-1 nucleotide sequence is SEQ ID NO: 10. 

Frzb-1 is an antagonist of Wnts in vivo, and 
thus is believed to find utility as a tumor suppressor 
gene, since overexpressed Wnt proteins cause cancer. 
Frzb-1 may also be a useful vehicle for solubilization 
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and therapeutic delivery of Wnt proteins complexed with 
it. 

The final cDNA isolated containing a signal 
sequence results in a peptide designated Paraxial 
5 Protocadherin (PAPC). The cDNA for PAPC is a divergent 
member of the cadherin multigene family. PAPC is most 
related to protocadherin 43 reported by Sano et al., The 
EMBO J • , 12, pp. 2249-2256, 1993. As shown in SEQ ID 
NO: 5, the PAPC gene encodes a transmembrane protein of 

10 896 amino acids, of which 187 are part of an 
intracellular domain. PAPC is a cell adhesion molecule, 
and microinjection of PAPC mRNA constructs into Xenopus 
embryos suggest that PAPC acts as a molecule involved in 
mesoderm differentiation. A soluble form of the PAPC 

15 extracellular domain is able to block muscle and 
mesoderm formation in Xenopus embryos. The nucleotide 
sequence encoding Xenopus PAPC is provided in SEQ ID 
NO: 6. 

Cerberus, frzb-1, or PAPC or fragments thereof 

20 (which also may be synthesized by in vitro methods) may 
be fused (by recombinant expression or in vitro covalent 
methods) to an immunogenic polypeptide and this, in 
turn, may be used to immunize an animal in order to 
raise antibodies against the novel proteins. Antibodies 

25 are recoverable from the serum of immunized animals. 
Alternatively, monoclonal antibodies may be prepared 
from cells from the immunized animal in conventional 
fashion. Immobilized antibodies are useful particularly 
in the diagnosis (in vitro or in vivo) or purification 

30 of cerberus, frzb-1, or PAPC 

Substitutional, deletional, or insertional 
mutants of the novel polypeptides may be prepared by in 
vitro or recombinant methods and screened for immuno- 
crossreactivity with cerberus, frzb-1, or PAPC and for 

35 cerberus antagonist or agonist activity. 
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Cerberus or frzb-1 also may be derivatized in 
vitro in order to prepare immobilized and labelled 
proteins, particularly for purposes of diagnosis of 
insufficiencies thereof, or for affinity purification of 
5 antibodies thereto. 

Among applications for the novel proteins are 
tissue replacement therapy and, because frzb-1 is an 
antagonist of Wnt signaling, tumor suppression 
therapies. The cerberus receptor may define a novel 
10 signalling pathway, in addition, frzb-1 could permit 
the isolation of novel members of the Wnt family of 
growth factors. 



Brief pgggyjpttoff Qf tire Prawingg 

Figure 1 illustrates the amino acid sequence 
15 (SEQ ID NO : 1 ) of the Fig. 2 cDNA clone for cerberus; 

Figure 2 illustrates a cDNA clone (SEQ ID 
N0:2) for cerberus derived from Xenopus. Sense strand 
is on top (5' to 3' direction) and the antisense strand 
on the bottom line (in the opposite direction); 
20 Figures 3 and 4 show the amino acid and 

nucleotide sequence, respectively, of full-length frzb-1 
from Xenopus (SEQ ID N0S:3 and 4); 

Figures 5 and 6 show the amino acid and 
nucleotide sequence, respectively, of full-length PAPC 
25 from Xenopus (SEQ ID H0S:5 and 6); 

Figures 7 and 8 show the amino acid and 
nucleotide sequence, respectively, of full-length frzb-1 
from mouse (SEQ ID N0S:7 and 8); and 

Figures 9 and 10 show the amino acid and 
30 nucleotide sequence, respectively, of full-length frzb-1 
from human (SEQ ID NOS:9 and 10). 
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Detailed Description of the Preferred Embodiments 

Among the several novel proteins and their 
nucleotide sequences described herein, is a novel 
endodermal, cardiac, and neural inducing factor in 
5 vertebrates that we have named "cerberus. " When 
referring to cerberus, the present invention also 
contemplates the use of fragments, derivatives, 
agonists, or antagonists of cerberus molecules. Because 
cerberus has no homology to any reported growth factors, 

10 it is proposed to be the founding member of a novel 
family of growth factors with potent biological 
activities, which may be isolated using SEQ id NO: 2. 

The amphibian organizer consists of several 
cell populations with region-specific inducing 

15 activities. On the basis of morphogenetic movements, 
three very different cell populations can be 
distinguished in the organizer. First, cells with 
crawling migration movements involute, fanning out to 
form the prechordal plate. Second, cells involute 

20 through the dorsal lip driven by convergence and 
extension movements, giving rise to the notochord of the 
trunk. Third, involution ceases and the continuation of 
mediolateral intercalation movements leads to posterior 
extension movements and to the formation of the tail 

25 notochord and of the chordoneural hinge. The three cell 
populations correspond to the head, trunk, and tail 
organizers, respectively. 

The cerberus gene is expressed at the right 
time and place to participate in cell signalling by 

30 Spemann's organizer. Specifically, cerberus is 
expressed in the head organizing region that consists of 
crawling-migrating cells ♦ The cerberus expressing 
region corresponds to the prospective foregut, including 
the liver and pancreas anlage, and the heart mesoderm. 
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Cerberus expression is activated by chordin, noggin, and 
organizer-specific homeobox genes. 

Our studies were conducted in early embryos of 
the frog Xenopus laevis. The frog embryo is well suited 
5 to experiments, particularly experiments pertaining to 
generating and maintaining regional differences within 
the embryo for determining roles in tissue differentia- 
tion. It is easy to culture embryos with access to the 
embryos even at very early stages of development 

10 (preceding and during the formation of body pattern and 
differentiation) and the embryos are large. The initial 
work with noggin and chordin also had been in Xenopus 
embryos, and, as predicted, was highly conserved among 
vertebrates. Predictions based on work with Xenopus as 

15 to corresponding human noggin were proven true and the 
ability to clone the gene for human noggin was readily 
accomplished. (See the description of Xenopus work and 
cloning information in PCT application, published March 
17, 1994, WO 9 405 800, and the subsequent human cloning 

20 based thereon in the PCT application, also published 
March 17, 1994, as WO 9 405 791.) 

The cloning of cerberus, frzb-1, and PAPC 
resulted from a comprehensive screen for cDNAs enriched 

25 in Spemann's organizer. Subtract! ve differential 
screening was performed as follows. In brief, poly A* 
RMA was isolated from 300 dorsal lip and ventral 
marginal zone (VMZ) explants at stage 10%. After first 
strand cDNA synthesis approximately 70-80% of common 

30 sequences were removed by substraction with biotinylated 
VMZ poly A* RNA prepared from 1500 ventral gastrula 
halves. For differential screening, duplicate filters 
(2000 plaques per 15 cm plate, a total of 80,000 clones 
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screened) of an unamplified oriented dorsal lip library 
were hybridized with radiolabeled dorsal lip or VMZ 
cDHA* Putative organizer-specific clones were isolated, 
grouped by sequence analysis from the 5' end and whole- 
5 mount in situ hybridization, and subsequently classified 
into known and new dorsal-specific genes. Rescreening 
of the library (100,000 independent phages) with a 
cerberus probe resulted in the isolation of 45 
additional clones, 31 of which had similar size as the 
10 longest one of the 11 original clones indicating that 
they were presumably full-length cDNAs . The longest 
cDNAs for cerberus, frzb-1, and PAPC were completely 
sequenced. 

To explore the molecular complexity of 
15 Spemann's organizer we performed a comprehensive 
differential screen for dorsal-specific cdnas. The 
method was designed to identify abundant cDNAs without 
bias as to their function. As shown in Table 1, five 
previously known cDNAs and five new ones were isolated, 
20 of which three (expressed as cerberus, frzb-1, and PAPC, 
respectively) had secretory signal sequences. 
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Previously Known Genes Qene Product No. of Isolates 

Chordin novel secreted protein 70 

Goosecoid homeobox gene 3 

5 Hntallavis/XFKH-1 16rkheadAranscriptmn factor 2 

Xnot-2 homeobox gene 1 

XUm-1 homeobox gene 1 

New Genes 

Cerberus novel secreted protein 1 1 

10 PAPC cadherin-like/transmembrane 2 

Frzb-1 novel secreted protein 1 

Sox-2 sryAranscription factor 1 

Fkh-tike forkhead/transcription factor 1 



The most abundant dorsal-specific cONA was 

15 chordin (chd), with 70 independent isolates. The second 
most abundant cDNA was isolated 11 times and named 
cerberus (after a mythological guardian dog with 
multiple heads ) . The cerberus cDNA encodes a putative 
secreted polypeptide of 270 amino acids, with an amino 

20 terminal hydrophobic signal sequence and a carboxy 
terminal cysteine-rich region (Fig. 1). Cerberus is 
expressed specifically in the head organizer region of 
the Xenopus embryo, including the future foregut. 

An abundant mRNA found in the dorsal region of 

25 the Xenopus gastrula encodes the novel putative secreted 
protein we have designated as cerberus. Cerberus mRNA 
has potent inducing activity in Xenopus embryos, leading 
to the formation of ectopic heads. Unlike other 
organizer-specific factors, cerberus does not dorsalize 

30 mesoderm and is instead an inhibitor of trunk-tail 
mesoderm. Cerberus is expressed in the anterior-most 
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domain of the gastrula Including the leading edge of the 
deep layer of the dorsal lip a region that, as shown 
here, gives rise to foregut and midgut endoderm. 
Cerberus promotes the formation of cement gland, 
5 olfactory placodes, cyclopic eyes, forebrain, and 
duplicated heart and liver (a foregut derivative ) • 
Because the pancreas is also derived from this foregut 
region, it is likely that cerberus induces pancreas in 
addition to liver. The expression pattern and inducing 

10 activities of cerberus suggest a role for a previously 
neglected region of the embryo, the prospective foregut 
endoderm, in the induction of the anterior head region 
of the embryo. 

Turning to Fig. 1, Xenopus cerberus encodes a 

15 putative secreted protein transiently expressed during 
embryogenesis and the deduced amino acid sequence of 
Xenopus cerberus is shown. The signal peptide sequence 
and the nine cysteine residues in the carboxy-terminus 
are indicated in bold. Potential N-linked glycosylation 

20 sites are underlined. In database searches the cerberus 
protein showed limited similarity only to the mammalian 
Dan protein, a possible tumor suppressor proposed to be 
a DNA-binding protein. 

Cerberus appears to be a pioneer protein, as 

25 its amino acid sequence and the spacing of its 
9 cysteine residues were not significantly similar to 
other proteins in the databases (NCBI-Gen Bank release 
93.0). We conclude that the second most abundant 
dorsal-specific cDNA encodes a novel putative secreted 

30 factor, which should be the founding member of a novel 
family of growth factors active in cell differentiation. 

Cerberus Demarcates an Anterior Organizer 
Domain . Cerberus mRNA is expressed at low levels in the 
unfertilized egg, and zygotic transcripts start 

35 accumulating at early gastrula. Expression continues 
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during gastrula and early neurula, rapidly declining 
during neurulation. importantly , cerberus expression 
starts about one hour after that of chd, suggesting that 
cerberus could act downstream of the chd signal. 
5 Whole-mount in situ hybridizations reveal that 

expression starts in the yolky endomesodermal cells 
located in the deep layer of the organizer. The 
cerberus domain includes the leading edge of the most 
anterior organizer cells and extends into the lateral 

10 mesoderm. The leading edge gives rise to liver, 
pancreas, and foregut in its midline, and the more 
lateral region gives rise to heart mesoderm at later 
stages of development. 

Fig. 2 sets out the sequence of a full length 

15 Xenopus cdna for cerberus. 

This entirely new molecule has demonstrated 
physiological properties that should prove useful in 
therapeutic, diagnostic, and clinical applications that 
require regeneration, differentiation, or repair of 

20 tissues, such wound repair, neuronal regenerational or 
transplantation , supplementation of heart muscle 
differentiation, differentiation of pancreas and liver, 
and other applications in which cell differentiation 
processes are to be induced. 

25 The second, novel, secreted protein we have 

discovered is called M frzb-1," which was shown to be a 
secreted protein in Xenopus oocyte microin j ec t ion 
experiments. Thus it provides a natural soluble form of 
the related extracellular domains of Drosophila and 

30 vertebrate frizzled proteins. We propose that the 
latter proteins could be converted into active soluble 
forms by introducing a stop codon before the first 
transmembrane domain. We have noted that the cysteine- 
rich region of frzb-1 and frizzled contains some overall 

35 structural homology with Wnt proteins using the Profile 
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Search homology program (Gribskov, Meth. Bnzymol., 183, 
pp. 146-159, 1990). This had raised the interesting 
possibility that frzb-1 could interact directly with Wnt 
growth factors in the extracellular space. This was 
5 because we had found that when microinjected into 
Xenopus embryos, frzb-1 constructs have moderate 
dorsalizing activity, leading to the formation of 
embryos with enlarged brain and head, and shortened 
truck. Somatic muscle differentiation, which requires 

10 Xwnt-8, was inhibited. In the case of frzb-1, an 
attractive hypothesis, suggested by the structural 
homologies, was that it may act as an inhibitor of 
Wnt-8, a growth factor that has ventralizing activity in 
the Xenopus embryo (Christian and Moon, Genes Dev., 7, 

15 pp. 13-28, 1993). We have shown that frzb-1 can 
interact with Xwnt-8 and Wnt-1, and it is expected that 
it could also interact with other members of the Wnt 
family of growth factors, of which at least 15 members 
exist in mammals. In addition, a possible interaction 

20 with Wnts was suggested by the recent discovery that 
dishevelled, a gene acting downstream of wingless, has 
strong genetic interaction with frizzled mutants in 
Drosophila (Krasnow et al., Development, 121, pp. 4095- 
4102, 1995). This possibility has been explored in 

25 depth (Leyns et al.. Cell, 88, pp. 747-756, March 21, 
1997), because a soluble antagonist of the Wnt family of 
proteins is expected to be of great therapeutic value. 
Examples 1 and 2 illustrate tests that show antagonism 
of Xwnt-8 by binding to frzb-1. 

30 vertebrate homologues of Frizzled have been 

isolated and they too are anchored to the cell membrane 
by seven membrane spanning domains (Wang et al., 
J. Biol. Chem., 271, pp. 4468-4476, 1996). Frzb-1 
differs from the frizzled proteins in that it is an 

35 entirely soluble, diffusible secreted protein and 
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therefore suitable as a therapeutic agent. The 
nucleotide sequence that when expressed results in 
frzb-1 protein is illustrated by SEQ ID N0:4. 

SEQ ID NO: 4 corresponds to the Xenopus 
5 homolog, but by using it in BLAST searches (and by 
cloning mouse frzb-1) we had been able to assemble the 
sequence of the entire mature human frzb-1 protein, SEQ 
ID NO: 9 . Indeed, human frzb-1 is encoded in six 
expressed sequence tags (ESTs) available in Genebank. 

10 The human frzb-1 sequence can be assembled by 
overlapping in the 5' to 3' direction the ESTs with the 
following accession numbers in Genebank: B18848, 
R63748, W38677, W44760, H38379, and N71244. No function 
had yet been assigned to these EST sequences, but we 

15 believe and thus propose here that human frzb-1 will 
have similar functions in cell differentiation to those 
described above for Xenopus frzb-1. The nucleotide 
sequence of human frzb-1 is shown in SEQ ID NO: 10. The 
mouse frzb-1 protein and nucleotide sequences are 

20 provided by SEQ ID N0S:7 and 8, respectively. 

In particular, we believe that frzb-1 will 
prove useful in gene therapy of human cancer cells. In 
this rapidly developing field, one approach is to 
introduce vectors expressing anti-sense sequences to 

25 block expression of dominant ocogenes and growth factor 
receptors. Another approach is to produce episomal 
vectors that will replicate in human cells in a 
controlled fashion without transforming the cells. For 
an example of the latter (an episomal expression vector 

30 system for human gene therapy), reference is made to 
IKS. Patent 5,624,820, issued April 29, 1997, inventor 
Cooper* 

Gene therapy now includes uses of human tumor 
suppression genes. For example, U.S. Patent 5,491,064, 
35 issued February 13, 1996, discloses a tumor suppression 
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gene localized on chromosome 11 and described as 
potentially useful for gene therapy in cancers deleted 
or altered in their expression of that gene. Frzb-1 
maps to chromosome 2q31-33 and loss of one copy of the 
5 2q31-33 and loss of one copy of the 2q arm has been 
observed with high incidence in lung carcinomas, 
colo-rectal carcinomas, and neuroblastomas, which has 
lead to the proposal that the 2q arm carries a tumor 
suppressor gene* We expect frzb to be a tumor 

10 suppressor gene, and thus to be useful in tumor 
suppression applications. 

A number of applications for cerberus and 
frzb-1 are suggested from their pharmacological 
(biological activity) properties. 

15 For example, the cerberus and frzb-1 cDNAs 

should be useful as a diagnostic tool (such as through 
use of antibodies in assays for proteins in cell lines 
or use of oligonucleotides as primers in a PCR test to 
amplify those with sequence similarities to the 

20 oligonucleotide primer, and to determine how much of the 
novel protein is present). 

Cerberus, of course, might act upon its target 
cells via its own receptor. Cerberus, therefore, 
provides the key to isolate this receptor. Since many 

25 receptors mutate to cellular oncogenes, the cerberus 
receptor should prove useful as a diagnostic probe for 
certain tumor types. Thus, when one views cerberus as 
ligand in complexes, then complexes in accordance with 
the invention include antibody bound to cerberus, 

30 antibody bound to peptides derived from cerberus, 
cerberus bound to its receptor, or peptides derived from 
cerberus bound to its receptor or other factors. Mutant 
forms of cerberus, which are either more potent agonists 
or antagonists, are believed to be clinically useful. 
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Such complexes of cerberus and its binding protein 
partners will find uses in a number of applications. 

Practice of this invention includes use of an 
oligonucleotide construct comprising a sequence coding 
5 for cerberus or frzb-1 and for a promoter sequence 
operatively linked in a mammalian or a viral expression 
vector. Expression and cloning vectors contain a 
nucleotide sequence that enables the vector to replicate 
in one or more selected host cells. Generally, in 

10 cloning vectors this sequence is one that enables the 
vector to replicate independently of the host 
chromosomes, and includes origins of replication or 
autonomously replicating sequences. The well-known 
plasmid pBR322 is suitable for most gram negative 

15 bacteria, the 2y plasmid origin for yeast and various 
viral origins (SV40, polyoma, adenovirus, VSV or BPV) 
are useful for cloning vectors in mammalian cells. 

Expression and cloning vectors should contain 
a selection gene, also termed a selectable marker. 

20 Typically, this is a gene that encodes a protein 
necessary for the survival or growth of a host cell 
transformed with the vector. The presence of this gene 
ensures that any host cell which deletes the vector will 
not obtain an advantage in growth or reproduction over 

25 transformed hosts. Typical selection genes encode 
proteins that (a) confer resistance to antibiotics or 
other toxins, e.g. ampicillin, neomycin, methotrexate or 
tetracycline, (b) complement auxotrophic deficiencies. 

Examples of suitable selectable markers for 

30 mammalian cells are dihydrofolate reductase ( DHFR ) or 
thymidine kinase. Such markers enable the identifica- 
tion of cells which were competent to take up the 
cerberus nucleic acid. The mammalian cell transformants 
are placed under selection pressure which only the 

35 transformants are uniquely adapted to survive by virtue 
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of having taken up the marker. Selection pressure is 
imposed by culturing the trans formants under conditions 
in which the concentration of selection agent in the 
medium is successively changed. Amplification is the 

5 process by which genes in greater demand for the 
production of a protein critical for growth are 
reiterated in tandem within the chromosomes of 
successive generations of recombinant cells. Increased 
quantities of cerberus or frzb-1 can therefor be 

0 synthesized from the amplified DNA. 

For example, cells transformed with the DHFR 
selection gene are first identified by culturing all of 
the transformants in a culture medium which contains 
methotrexate (Mtx), a competitive antagonist of DHFR. 

5 An appropriate host cell in this case is the Chinese 
hamster ovary ( CH0 ) cell line deficient in DHFR 
activity, prepared and propagated as described by Urlaub 
and Chasin, Proc. Nat. Acac. Sci., 77, 4216 (1980). The 
transformed cells then are exposed to increased levels 

0 of Mtx. This leads to the synthesis of multiple copies 
of the DHFR gene and, concomitantly, multiple copies of 
other DNA comprising the expression vectors, such as the 
DNA encoding cerberus or frzb-1. Alternatively, host 
cells transformed by an expression vector comprising DNA 

5 sequences encoding cerberus or frzb-1 and aminoglycoside 
3' phosphotransferase (APH) protein can be selected by 
cell growth in medium containing an aminoglycosidic 
antibiotic such as kanamycin or neomycin or G418. 
Because eukaryotic cells do not normally express an 

0 endogenous APH activity, genes encoding APH protein, 
commonly referred to as neo resistant genes, may be used 
as dominant selectable markers in a wide range of 
eukaryotic host cells, by which cells transformed by the 
vector can readily be identified. 
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Expression vectors, unlike cloning vectors, 
should contain a promoter which is recognized by the 
host organism and is operably linked to the cerberus 
nucleic acid. Promoters are untranslated sequences 
5 located upstream from the start codon of a structural 
gene (generally within about 100 to 1000 bp) that 
control the transcription and translation of nucleic 
acid under their control. They typically fall into two 
classes, inducible and constitutive. Inducible 

10 promoters are promoters that initiate increased levels 
of transcription from DNA under their control in 
response to some change in culture conditions, e.g. the 
presence or absence of a nutrient or a change in 
temperature. At this time a large number of promoters 

15 recognized by a variety of potential host cells are well 
known • These promoters can be operably linked to 
cerberus encoding DNA by removing them from their gene 
of origin by restriction enzyme digestion, followed by 
insertion 5* to the start codon for cerberus or frzb-1. 

20 Nucleic acid is operably linked when it is 

placed into a functional relationship with another 
nucleic acid sequence. For example, DNA for a 
presequence or secretory leader is operably linked to 
DNA for a polypeptide if it is expressed as a preprotein 

25 which participates in the secretion of the polypeptide; 
a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the 
sequence; or a ribosome binding site is operably linked 
to a coding sequence if it is positioned so as to 

30 facilitate translation. Generally, operably linked 
means that the DNA sequences being linked are contiguous 
and, in the case of a secretory leader, contiguous and 
in reading phase. Linking is accomplished by ligation 
at convenient restriction sites. If such sites do not 
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exit then synthetic oligonucleotide adapters or linkers 
are used in accord with conventional practice. 

Transcription of the protein-encoding DNA in 
mammalian host cells is controlled by promoters obtained 
5 from the genomes of viruses such as polyoma, cytomegalo- 
virus, adenovirus, retroviruses, hepatitis -B virus, and 
most preferably Simian Virus 40 (SV40), or from 
heterologous mammalian promoters, e.g. the actin 
promoter. Of course, promoters from the host cell or 

10 related species also are useful herein. 

Cerberus and frzb-1 are clearly useful as a 
component of culture media for use in culturing cells, 
such as endodermal, cardiac, and nerve cells, in vitro. 
We believe cerberus and frzb-1 will find uses as agents 

15 for enhancing the survival or inducing the growth of 
liver, pancreas, heart, and nerve cells, such as in 
tissue replacement therapy. 

The final CDMA isolated containing a signal 
sequence results in a peptide designated Paraxial 

20 Protocadherin (PAPC). The cDNA for PAPC is a divergent 
member of the cadherin multigene family. PAPC is most 
related to protocadherin 43 reported by Sano et al.. The 
EMBO J. , 12, pp. 2249-2256, 1993. As shown in SEQ ID 
NO: 5, the PAPC gene encodes a transmembrane protein of 

25 896 amino acids, of which 187 are part of an 
intracellular domain* PAPC is a cell adhesion molecule, 
and microinjection of PAPC mRNA constructs into Xenopus 
embryos suggest that PAPC acts in mesoderm 
differentiation. The nucleotide sequence encoding 

30 Xenopus PAPC is provided in SEQ ID NO: 6. 

Therapeutic formulations of the novel proteins 
may be prepared for storage by mixing the polypeptides 
having the desired degree of purity with optional 
physiologically acceptable carriers, excipients or 

35 stabilizers, in the form of lyophilized cake or aqueous 
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solutions- Acceptable carriers, excipients or 
stabilizers are nontoxic to recipients at the dosages 
and concentrations employed, and include buffers such as 
phosphate, citrate, and other organic acids; anti- 
5 oxidants including ascorbic acid; low molecular weight 
(less than about 10 residues) polypeptides; proteins, 
such as serum albumin, gelatin or immunoglobulins. 
Other components can include glycine, blutamine, 
asparagine , arginine , or lysine ; monosaccharides , 

10 disaccharides, and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as 
EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
forming counter ions such as sodium; and/or nonionic 
surfactants such as Tween, Pluronics or PEG. 

15 Polyclonal antibodies to the novel proteins 

generally are raised in animals by multiple subcutaneous 
(sc) or intraperitoneal (ip) injections of cerberus or 
frzb-1 and an adjuvant. It may be useful to conjugate 
these proteins or a fragment containing the target amino 

20 acid sequence to a protein which is immunogenic in the 
species to be immunized, e.g., keyhole limpet 
hemocyanin, serum albumin, bovine thyroglobulin, or 
soybean trypsin inhibitor using a bifunctional or 
derivatizing agent, for example, maleimidobenzoyl 

25 sulfosuccinimide ester (conjugation through cysteine 
residues), N-hydroxysuccinimide (through lysine 
residues), glutar aldehyde, succinic anhydride, S0C1 2 , or 
R*N - C ■ NR. 

Animals can be immunized against the immuno- 

30 genie conjugates or derivatives by combining 1 mg or 1 
fig of conjugate (for rabbits or mice, respectively) 
with 3 volumes of Freund's complete adjuvant and 
injecting the solution intradermally in multiple sites. 
One month later the animals are boosted with 1/5 to 1/10 

35 the original amount of conjugate in Fruend's complete 
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adjuvant by subcutaneous injection at multiple sites. 
Seven to 14 days later animals are bled and the serum is 
assayed for anti-cerberus titer. Animals are boosted 
until the titer plateaus. Preferably, the animal is 
5 boosted with the conjugate of the same cerberus or 
frzb-1 polypeptide, but conjugated to a different 
protein and/or through a different cross-linking agent. 
Conjugates also can be made in recombinant cell culture 
as protein fusions. Also, aggregating agents such as 

10 alum are used to enhance the immune response. 

Monoclonal antibodies are prepared by 
recovering spleen cells from immunized animals and 
immortalizing the cells in conventional fashion, e.g. by 
fusion with myeloma cells or by EB virus transformation 

15 and screening for clones expressing the desired 
antibody. 

Antibodies are useful in diagnostic assays for 
cerberus, frzb-1, or PAPC or their antibodies and to 
identify family members. in one embodiment of a 

20 receptor binding assay, an antibody composition which 
binds to all of a selected plurality of members of the 
cerberus family is immobilized on an insoluble matrix, 
the test sample is contacted with the immobilized 
antibody composition in order to adsorb all cerberus 

25 family members, and then the immobilized family members 
are contacted with a plurality of antibodies specific 
for each member, each of the antibodies being 
individually identifiable as specific for a predeter- 
mined family member, as by unique labels such as 

30 discrete fluorophores or the like. By determining the 
presence and/or amount of each unique label, the 
relative proportion and amount of each family member can 
be determined. 

The antibodies also are useful for the 

35 affinity purification of the novel proteins from 
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recombinant cell culture or natural sources. Antibodies 
that do not detectably cross -react with other growth 
factors can be used to purify the proteins free from 
these other family members . 

5 EXAMPLE 1 

Frzb-1 Antagonizes Xwnt-8 Non-Cell Autonomously 

To test whether frzb-1 can antagonize 
secondary axes caused by Xwnt-8 after secretion by 
injected cells, an experimental design was used. Thus, 

10 frzb-1 mRNA was injected into each of the four animal 
blastomeres of eight-cell embryos, and subsequently, a 
single injection of Xwnt-8 mRNA was given to a vegetal- 
ventral blastomere at the 16-32 cell stage. In two 
independent experiments, we found that injection of 

15 frzb-1 alone (n=13) caused mild dorsalization . with 
enlargement of the cement gland in all embryos and that 
injection of Xwnt-8 alone (n-53) lead to induction of 
complete secondary axes in 67% of the embryos. However, 
injection of frzb-1 into animal caps abolished the 

20 formation of complete axes induced by Xwnt-8 (n=27), 
leaving only a residual 14% of embryos with very weak 
secondary axes. The double-injected embryos retained 
the enlarged cement gland phenotype caused by injection 
of frzb-1 mRNA alone. Because both mRNAs encode 

25 secreted proteins and were microinjected into different 
cells, we conclude that the antagonistic effects of 
frzb-1 and Xwnt-8 took place in the extracellular space 
after these proteins were secreted. 
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Membrane-Anchored Wnt-1 Confers Frzb-1 Binding 

To investigate a possible interaction between 
frzb-1 and Writs, the first step was to insert an HA 

5 epitope tag into a Xenopus frzb-1 construct driven by 
the CMV (cytomegalovirus) promoter. Frzbl-HA was tested 
in mRNA microinjection assays in Xenopus embryos and 
found to be biologically active. Conditioned medium 
from transiently transfected cells contained up to 10 

o /ig/ml of Frzbl-HA (quantitated on Western blots using an 
HA- tagged protein standard). 

Transient transfection of 293 cells has been 
instrumental in demonstrating interactions between 
wingless and frizzled proteins. We therefore took 

5 advantage of constructs in which Wnt-1 was fused at the 
amino terminus of CD8, generating a transmembrane 
protein containing biologically active Wnt-1 exposed to 
the extracellular compartment. A WntlCD8 cDNA construct 
(a generous gift of Dr. H. varmus, NIH) was subcloned 

0 into the pcDNA (Invitrogen) vector and transfected into 
293 cells* After incubation with Frzbl-HA-conditioned 
medium (overnight at 37°C), intensely labeled cells were 
observed by immunofluorescence. As a negative control, 
a construct containing 120 amino acids of Xenopus 

5 chordin f an unrelated secreted protein was used. 
Transfection of this construct produced background 
binding of Frzbl-HA to the extracellular matrix, both 
uniform and punctate. Cotrans feet ion of WntlCD8 with 
pcDNA-LacZ showed that transfected cells stained 

0 positively for Frzbl-HA and LacZ. Since WntlCD8 
contains the entire CD8 molecule, a CD 8 cDNA was used as 
an additional negative control. After transfection with 
LacZ and full-length CB8, Frzbl-HA failed to bind to the 
transfected cells. Although most of our experiments 
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were carried out at 37° C, Frzbl-HA-conditioned medium 
also stained WntlCD8-transfected cells after incubation 
at 4°C for 2 hours. 

Attempts to biochemically quantitate the 
5 binding of Frzb-1 to WntlCD8-transfected cells were 
unsuccessful due to high background binding to control 
cultures, presumably due to binding to the extracellular 
matrix. Thus, we were unable to estimate a K D for the 
affinity of the Frzb-1 /Wnt-1 interaction* However, when 

10 serial dilutions of conditioned medium containing 
Frzbl-HA were performed (ranging from 2*5 x 10~ 7 to 1.25 
x 10* 10 M), staining of WntlCD8-transf ected cells was 
found at all concentrations. 

Although we have been unable to provide 

15 biochemical evidence for direct binding between Wnts and 
frzb-1, this cell biological assay indicates that 
Frzbl-HA can bind, directly or indirectly, to Wnt-1 on 
the cell membrane in the 10" 10 M range. 



It is to be understood that while the 
20 invention has been described above in conjunction with 
preferred specific embodiments, the description and 
examples are intended to illustrate and not limit the 
scope of the invention, which is defined by the scope of 
the appended claims. 
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It is Claimed s 

1* A substantially pure protein 
characterized by a physiologically active form and 
comprising an amino acid sequence encoded by the DNA of 
SEQ ID N0:2. 

2. The protein as in claim 1 having 
neurotrophic, growth or differentiation factor activity. 

3. A composition comprising the protein of 
claim 1 and a physiologically acceptable carrier with 
which the peptide is admixed. 

4. An oligonucleotide construct comprising 
a sequence coding for a protein and an expression vector 
operatively linked therewith, the protein having 
neurotrophic, growth or differentiation factor activity 

5 and being expressible from SEQ ID NO: 2. 

5. The construct as in claim 4 wherein the 
expression vector is a mammalian or viral expression 
vector . 

6. A substantially pure protein 
characterized by a physiologically active form and 
comprising an amino acid sequence encoded by the DNA of 
SEQ ID N0:4, SEQ ID N0:8, or SEQ ID NO:10. 

7. The protein as in claim 6 having 
neurotrophic, growth or differentiation factor activity. 

8. A composition comprising the protein of 
claim 6 and a physiologically acceptable carrier with 
which the protein is admixed. 



WO 97/48275 



PCT/US97/10942 



26 

9. An oligonucleotide construct comprising 
a sequence coding for a protein and an expression vector 
operatively linked therewith, the protein being 
expressible from SEQ id NO: 4, SEQ id NO: 8 or SEQ id 

5 NO:10. 

10. The construct as in claim 9 wherein the 
protein is expressible in soluble form* 

11. The construct as in claim 9 wherein the 
expression vector is a mammalian or viral expression 
vector . 

12* A complex comprising a substantially pure 
frzb-1 protein camplexed with at least one Wnt protein* 

13. A substantially pure protein 
characterized by a physiologically active form and 
comprising an amino acid sequence encoded by the DNA of 
SEQ ID NO: 6. 

14 . The protein as in claim 13 having 
mesoderm differentiation activity. 

15. A composition comprising the protein of 
claim 13 and a physiologically acceptable carrier with 
which the protein is admixed. 
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MLEHVLRICI IVCLVNDGAG KHSE6RERTK TYSLNSRGYF 40 

RKERGARRSK ILLVNTKGLD EPHXGHGDFG LVAELFDSTR 80 

THTNRKEPDM NKVKLFSTVA HGNKSARRKA YNGSKRHIFS 120 

RRSFDKRNTE VTEKPGAKMF WNNFLVKMNG APQNTSHGSK 160 

AQEIMKEACK TLPFTQNIVH ENCDRMVIQN NLCFGKCISL 200 

HVPNQQDRRN TCSHCLPSKF TLNHLTLNCT GSKNWKWM 240 

MVEECTCEAH KSNFHQTAQF NMDTSTTLHH 270 
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GAATTCCCAG CAAGTCGCTC AGAAACACTG CAGGGTCTAG ATATCATACA ATGTTACTAA 60 
CTTAAGGGTC GTTCAGOGAG TCTTTGTGAC GTCCCAGATC TATAGTATGT TACAATGATT 

ATGTACTCAG GATCTGTATT ATCGTCTGCC TTGTGAATGA TGGAGCAGGA AAACACTCAG 120 
TACATGAGTC CTAGACATAA TAGCAGACGG AACACTTACT ACCTOGTCCT TTCGTGAGTC 

AAGGACGAGA AAGGACAAAA ACATATTCAC TTAACAGCAG AGGTTACTTC AGAAAAGAAA 180 
TTCCTGCTCT TTCCTGTTTT TGTATAAGTG AATTGTOGTC TCCAATGAAG TCTTTTCTTT 

GAGGAGCACG TAGGAGCAAG AXTCTGCTGG TGAATACTAA AGGTCTTGAT GAACCOCACA 240 
CTCCTCGTGC ATCCTOGTTC TAAGAOGACC ACTTATGATT TCCAGAACTA CTTGGGGTGT 

TTGGGCATGG TGATTTTCGC TTAGTAGCTG AACTATTTGA TTCCACCAGA ACACATACAA 300 
AACCCGTACC ACTAAAAGCG AATCATCGAC TTGATAAACT AAGGTGGTCT TGTGTATGR 

ACAGAAAAGA GCCAGACATG AACAAAGTCA AGCTTTTCTC AACAGTTGCC CATGGAAACA 360 
TGTCTTTTCT CGGTCTGTAC TTGTTTCAGT TCGAAAAGAG TTGTCAACGG GTACCTTTGT 

AAAGTGCAAG AAGAAAAGCT TACAATGGTT CTAGAAGGAA TATTTTTCCT CGOOGttCTT 420 
TTTCACGTTC TTCTTTTOGA ATGTTAOCAA GAtCTTOCTT ATAAAAAGGA GCGGCAAGAA 

TTGATAAAAG AAATACAGAG GTTACTGAAA AGOCTGGTGC CAAGATGTTC TGGAACAATT 480 
AACTATTTTC TTTATGTCTC CAATGACTTT TCGGACCAOG GTTCTACAAG ACCTTGTTAA 

TTTTGGTTAA AATGAATGGA GCCCCACAGA ATACAAGCCA TGGCAGTAAA GCACAGGAAA 540 
AAAACCAATT TTACTTACCT OGGGGTGTCT TATGTTCGGT ACCGTCATTT CGTGTCCTTT 

TAATGAAAGA AGCTTGCAAA ACCTTGTTTT TCACTCAGAA TATTGTACAT GAAAACTGTG 600 
ATTACTTTCT TCGAACGTTT TGGAACAAAA AGTGAGTCTT ATAACATGTA CTTTTGACAC 

ACAGGATGGT GATACAGAAC A A TCTGTGCT TTGGTAAATG CATCTCTCTC CATGTTOCAA €60 
TGTOCtACCA CTATGTCTTG TTAGACAOGA AACCATTTAC GTAGAGAGAG GTACAAGGTT 

ATCAGCAAGA TOGAOGAAAT ACTTGTTCCC ATTGCTTGCC GTCCAAATTT ACCCTGAACC 720 
TAGTOGTTCT A GCT GCTTTA TGAACAAGGG TAAOGAAOGG CAGGTTTAAA TGGGACTTGG 

ACCTGACGCT GAATTGTACT GGATCTAAGA ATGTAGTAAA GGTTGTCATG ATGGTAGAGG 780 
TGGACTGCGA CTTAACATGA CCTAGATTCT TACATCATTT CCAACAGTAC TACCATCTCC 

AATGCACGTG TGAAGCTCAT AAGAGCAACT TCCACCAAAC TGCACAGTTT AACATGGAXA 840 
TTACGTGCAC ACTTCGAGTA TTCTOGTTGA AGGTGGTTTG AOGTGTCAAA TTGTAOCTAT 

CATCTACTAC CCTGCACCAT TAAAGGACTG CCAXACAGTA TGGAAATGCC CTTTTGTTGG 900 
GTAGATGATG GGACGTGGTA ATTTCCTGAC GGTATGTCAT ACCTTTACGG GAAAACAAOC 

AATATTTGTT ACATACTATG CATCTAAAGC ATTATGTTGC CTTCTATTTC AtAXAACCAC 960 
TTATAAACAA TGTATGATAC GTAGATTTCG TAATACAACG GAAGATAAAG TATATTGGTG 

ATGGAATAAG GATTGTATGA ATTATAATTA ACAAATGGCA TTTTGTGTAA CATGCAAGAT 1020 
TACCTTATTC CTAACATACT TAATATTAAT TGTTTAOCGT AAAACACATT GTACGTTCTA 
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CTCTGTTOCA TCAGTTGCAA GATAAAAGGC AATATTTGTT TGACTTWTT TCTACAAAAT 1080 
GAGACAAGGT AGTCAAOGTT CTATTTTCCG TTATAAACAA ACTGAAAAAA AGATGTTTTA 

GAATACCCAA ATATATG&TA AGAXAATGGG GTCAAAACTG TTAAGGGGTA ATGTAATAAT 1140 
CTTATGGGTT TATATACTAT TCTATTAOCC CAGTTTTGAC AATTCCCCAT TACATTATTA 

AGGGACTAAG TTTGCCCAGG AGCAGTGACC CATAACAACC AATC&GCAGG TATGATTTAC 1200 
TCCCTGATTC AAACGGGTCC TCGTCACTGG GTATTGTTGG TTAGTCGTCC ATACTAAATG 

TGGTCACCTG TTTAAAAGCA AACATCRAT TGGTTGCTAT GGGTTACTGC TTCTGGGCAA 1260 
ACCAGTGGAC AAATTTTCGT TTGTAGAAIA AOCAAOGATA CCCAATGACG AAGACCOGTT 

AATGTGTGCC TCATAGGGGG GTTACTGTGT TGTGTACTGA ATAAATTGTA TTTATTTCAT 1320 
TTACACACGG AGTATCCCCC CAATCACACA ACACATGACT TATTTAACAT AAATAAAGTA 

TGTTACAAAA AAAAAAAA 
ACAATGTTTT TTTTTTTT 
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GAATTCCCTT TCACACAGGA CTCCTGGCAG AGGTGAATGG TTAGCCCTAT GGAWTGGTT €0 
CTTAAGGGAA AGTGTGTCCT GAGGACCGTC TCCACTTACC AATCGGGAIA CCTAAACCAA 

TGTTGATTTT GACACATGAT TGAXTGCTTT GAGAXAGGAX TGAAGGACTT GGATTTTTAT 120 
ACAACTAAAA CTGTGXACTA ACTAAOGAAA GTCTATCCIA ACTTCCTGAA CCTAAAAAIA 

CTAATTCTGC ACTTTTAAAT TAXCTGAGTA ATTGTTCATT TTGTATTGGA TGGGACTAAA 1B0 
GATTAAGACG TGAAAATTTA AtAGACTCAT TAACAAGTAA AACATAACCT ACCCTGATTT 

GATAAACTTA ACTCCTTGCT TTTGACTTGC CCATAAACTA TAAGGTGGGG TGAGTTGXAG 240 
CTATTTGAAT TGAGGAAGGA AAACTGAAOG GGTATTTGAT ATTCG&OCCC ACTCAACATC 

TTGCTTTTAC ATGTGCOCAG ATTTTCCCTG TATTCCCIGT ATTCCCTCTA AAGTAAGOCT 300 
AACGAAAATG TACACGGGTC TAAAAGGGAC ATAAGGGACA TAAGGGAGAT TTCATTCGGA 

ACACATACAG GTTGGGGAGA AXAACAATGT CTOGAACAAG GAAAGTGGAC TCATTACTGC 360 
TGTGTATGTC CAACCCGTCT TATTGTTACA GAGCTTGTTC CTTTCACCTG AGTAATGACG 

TACTGGOCAT ACCTGGACTG GCGCTTCTCT TATTAOOCAA TGCTTACTGT GCTTCGT6TG 420 
ATGACCGGTA TGGAOCTGAC CGCGAAGAGA ATAATGGGW ACGAATGACA CGAAGCACAC 

AOOCTGTGCG GATCCCCATG TGCAAATCTA TGCCATGGAA CATGACCAAG ATGCCCAACC 480 
TCGGACACGC CTAGGGGTAG AOGTTTAGAT AOGGTAOCTT GTACT GG TTC TA CG GG TTGG 

ATCTCCACCA CAGCACTCAA GCCAATGCCA TCCTGGCAAS TGAACAGTTT GAAGGTTTGC 540 
TAGAGGTGGT GTCGTGAGTT CGGTTADGGT AGGAOCGTTA ACTTGTCAAA CTTCCAAACG 

TGACCACTGA ATGTAGCCAG GACCTTTTGT TCTTTCTGTG TGCCATGXAX GCCCCCATTT 600 
ACTGGTGACT TACATCGGTC CTGGAAAACA AGAAAGACAC AOGGTACATA CGGGGGTAAA 

GTACCATCGA TTTCCAGCAT GAACCAATTA AGCCT TGCA A GTCOGTGTGC GAAAGGG0CA 660 
CATGGTAGCT AAAGGTOGTA CTTGGTTAAT TCGGAACGTT CAGGCACACG CTTTOOCGGT 

OG0OCGGCIG TGAGCCCATT CTG&EAAAGT ACCGGCACAC TTGGCCAGAG AG C CTGGCAT 720 
CCCGGCCGAC ACTCGGGTAA GAGtATTTCA TGGOOGTGTG AftCOQGTCTC TOGGACCGTA 

CTGAAGMSCT GCOOGTATAT GACAGAGGAG TCTGCATCTC CCCAGAGGCT ATCGTCACAG 780 
CACTTCTCGA CGGGCAXAXA CTGTCTOCT C AGAOGTAGAG GGGTCTOCGA TAGCAGTGTC 

TGGAACXAGG AACAGATTCA ATGCCAGACT TCTOCATGGA TTCAAACAAT GGAAATTGCG 040 
AOCTTGTTCC TTGTCTAAGT TAOGGTCTGA AGAGGTACCT AAGTTTGTTA CCTTTAAOGC 



GAAGOGGCAG GGAGCACTGT AAATGCAAGC CCATGAAGGC AACCCAAAAG ACGTATCTCA 
CTTCGCOGTC CCTCGTGACA TTTACGTTCG GGT A GTTCCG TTGGGTTTTC TGCATAGAGT 



900 



AGAATAATTA CAATTATGTA ATCAGAGCAA AAGTGAAAGA GGTGAAAGTG AAATGOCACG 960 
TCTTATTAAT GTTAATACAT TAGTCTOGTT TTCACTTTCT CCACTTTCAC TTTACGGTGC 



ACGCAACAGC AATTGTGGAA GTAAAGGAGA TTCTCAAGTC TTCCCTAGTG AACATTOCTA 
TGCGTTGTCG TTAACACCTT CATTTCCTCT AAGAGTTCAG AAGGGATCAC TTGTAAGGAT 
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AAGAGACAGT GACACTGTAC AGCAACTCAG GCTG CTTGTG CCOCGAGCTT GTTGCCAATG 1080 
TTCTGTGTCA CTGTGACAXG TCGTTGAGTC CGAOGAACAC GGGGGTCGAA CAACGGTTAC 

AGGAAXACAT AATTATGGGC TATGAAGACA AAGAGCGTAC CAGGCTTCTA CTAGTGGAAG 1140 
TCCTTATGTA TTAATACCCG ATACTTCTGT TTCTCGCATG GTOOGAAGAT GATCAOCTTC 

GATCCTTGGC CGAAAAATGG AGAGATCGTC TTGCXAAGAA AGTCAAGCGC TGGGATCAAA 1200 
CTAQGAACCG GC VTTT TACC TCTCTAGCAG AACGAMCTT TCAGTTCGCG ACCCTAGTTT 

AGCTTCGACG TCOCAGGAAA AGCAAAGAOC CCGTGGCTCC AATTOCCAAC AAAAACAGCA 1260 
TCGAAGCTGC AGGGTCCTW TCGTTTCTGG GGCAOOGAGG TTAAGGGTTG TTTTTGTOGT 

ATTOCAGACA AGCGCGTAGT TAGACCAACG GAAAGGTGTA TGGAAACTCT ATGGACTTTG 1320 
TAAGGTCTGT TCGCGCATCA ATCTGATTGC CTTTCCACAT AOCTTTGAGA TACCTGAAAC 

AAACTAAGAT TTGCATTGTT GGAAGAGCAA AAAAGAAATT GCACXACAGC AOGTTATATT 1380 
TTTGATTCTA AACGTAACAA CCTTCTCGTT TTTTCTTTAA CGTGAXGTOG TGCAATAXAA 

CTATTGTTTA CTACAAGAAG CT GGTTXA GT TGATTGTAGT TCT C CTTTCC TT C TT TTTTT 1440 
GATAACAAAT GATGTTCTTC GAOCAAATCA ACTAACATCA AGAGGAAAGG AAGAAAAAAA 

TTATAACTAT ATTTGCACGT GTTCCCAGGC AATTGTTTTA TTCAACTTOC AGTGACAGAG 1500 
AATATTGATA TAAAOGTGCA CAAGGGTOOG TTAACAAAAT AAGTTGAAGG TCACTGTCTC 

CAGTGACTGA ATGTCTCAGC CXAAAGAAGC TCAATTCATT TCTGATCAAC TAATGGTGAC 1560 
GTCACTGACT TACAGAGTOG GATTTCTTCG AGTTAAGTAA AGACTAGTTG ATTACCACTG 

AAGTGTTTGA TACTTGGGGA AAGTGAACTA ATTGCAATGG TAAATCAGAG AAAAGTTGAC 1620 
TTCACAAACT ATGAACCCCT TTCACTTGAT TAACGTTAOC ATTTAGTCTC TTTTCAACTG 

CAAXGTTGCT TTTCCTGTAG ATGAACAAGT G&GAGATCAC AMTAAATGA TGATCACRT 1680 
GTTAGAAOGA AAAGGACATC TACTT G TTCA CTCTCTAGTG TAAATTTACT ACTAGTGAAA 

CCATTTAATA C TTTCAGCAG TTTTAGTTAG ATG&CAXGTA GGATGCACCT AAATCTAAAT 1740 
GGTAAATTAT GAAAGTCGTC AAAATCAATC TACTGTACAT CCTAOGTGGA TTTAGATTTA 

ATTTTATCAT AAATGAAGAG CTGGTTTAGA CTGTATGGTC ACTGTTGGGA AGGTAAAXGC 1800 
TAAAATAGTA TTTACTTCTC GAOCAAATCT GAGAXAGCAG TGACAACCCT TCCATTTACG 

CTA C TTT G TC AATTCTGTTT TAAAAATTGC CTAAAXAAAT ATTAAGTCCT AAATAAAAAA 1860 
GATGAAACAG TTAAGACAAA ATTTTTAAOG GATTTATTTA TAATTCAGGA 1TEATTTTTT 

AAAAAAAAAA AAAAA 
l ' T T l 'T T TI T T TTTTT 
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KLLLFRAIPM LLLGU4VLQT 
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GAATTCCCAG AGATGAACTC CTTGAGATTG TTTTAAATGA CTGCAGGTCT GGAAGGATTC 60 
CTTAAGGGTC TCTACTTGAG GAACTCTAAC AAAATTTACT GACGTCCAGA CCTTCCTAAG 

ACATTGGGAC ACTGTTTCTA GGCAT GAAAA AACTGCAAGT TTCAACTTTG TTVT TGCT UC 120 
TGTAACGGTG TGACAAAGAT COGTACTTTT TTGAOGTTCA AAGTTGAAAC AAAAACCACG 

AACTTTGATT CTTCAAGATG CTGCTTCTCT TCAGAGCCAT TCCAATGCTG CTGTTGGGAC 180 
TTGAAACTAA GAAGTTCTAC GACGAAGAGA AGTCTCGGTA AGGTTACGAC GACAACCCTG 

TGATGGTTTT ACAAACAGAC TGTGAAATTG CCCAGTACTA CAXAGATGAA fiAJ Ma*vXXT 240 
ACTACCAAAA TGTTTGTCTG ACACTTTAAC GGGTCATGAT GTATCTACTT CTTCTTGGOG 

CTGGCACTGT AATTGCAGT6 TTGTGACAAC ACTCCAIATT TAACACTACA GATATAOCTG 300 
GACCGTGACA TTAACGTCAC AACAGTGTTG TGAGGTAZAA ATTGTGATGT CTAIATGGAC 

CAACCAATTT CCGTCTAATG AAGCAATTTA AIAATTCCCT TATCGGAGTC CGTGAGAGTG 360 
GTTGGTTAAA GGCAGATTAC TTCGTTAAAT TATTAA6GGA AXAGCCTCAG GCACTCTCAC 

ATGGGCAGCT GAGCATCATG GAGAGGAWG ACCGGGAGCA AATCTGCAGG CAGTCCCTTC 420 
TACCCGTOGA CTCGTAGTAC CTCTCCTAAC TGGCCCTCGT TTAGACGTCC GTCAGGGAAG 

ACTGCAACCT GGCTTTGGAT GTGGTCAfiCT TTTCCAAAGG ACACTTCAAG CTTCTGAACG 480 
TGACGTTGGA CCGAAACCTA CACCAGTCGA AAAGGTTTCC TGTCAAGTTC GAAGACTTGC 

TGRAAGTGGA GGTGAGAGAC ATTAATGACC ATAGCCCTCA CTTTCCCAGT GAAATAATGC 540 
ACTTTCACCT CCACTCTCTG TAATTACTGG TAXCGGGAGT GAAAGGGTCA CTTTATTACG 

ATGTGGAGGT GTCTGAAAGT TOCTCTGTGG GCACCAGGAT TCCTTTAGAA ATTGCAATAG 600 
MCACCTCCA CAGACTTTCA AGGAGACACC CGTGGTOCTA AGGAAATCTT TAACGTTATC 

ATGAAGATGT TGGGTCCAAC TCCATCCAGA ACTTTCAGAT CTCAAATAAT AGOCACTTCA 660 
TACTTCTACA AOCCAGGTTG AGGTAGGTCT TGAAAGTCTA GAGTTTATTA TCGGTGAAGT 

GCATTGATGT GCTAACCAGA GCAGATGGGG TGAAATATGC AGATTTAGTC TTAATGAGAG 720 
CGTAACTACA CGATTGGTCT CGTCTACCCC ACTTTATACG TCTAAATCAG AATTACTCTC 

AACMGACAG GGAAA TCCAG OCAACATACA TAATGGAGCT ACTAGCAATG GATGGGGGTG 780 
TTGAOCIGTC CCTTTAGGTC GGTTGTATGT ATTACCTCGA TGATOGTTAC CTAOC00CAC 

TACCATCACT ATCTGGTACT GCAGTGGTtA ACATCCGAGT CCTGGACTTT AATGATAACA 840 
ATGGTAGTGA TAGAOCATGA CGTCACCAAT TGTAGGCTCA GGACCTGAAA TTACTATTGT 

GCCCAGTGTT TGAGAGAAGC ACCATTGCTG TGGACCTAGT AGAGGATGCT CCTCTGGGAT 900 
CGGGTCACAA ACTCTCTTCG TGGTAACGAC AOCTGGATCA TCTCCTACGA GGAGAOOCTA 

ACCWTTGW GGAGTTACAT GCTACTGAOG ATGATGAAGG AGTGAATGGA GAAATTGTTT 960 
TGGAAAACAA CCTCAATGTA OGATGACTGC TACXACTTOC TCACTTACCT CTTTAACAAA 

ATGGATTCAG C&CTTTGGCA TCTCAAGAGG TACGTCAGCT ATTTAAAATT AACTCCAGAA 1020 
TACCTAAGTC GTGAAAOCGT AGAGTTCTCC ATGCAGTCGA TAAATTTTAA TTGAOGTCTT 
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CTGGGAGTGT TACTCTTGAA GGCCAAGTTG ATTTTGA6AC CAAGCAGACT TAOGAATTTC 1060 
GACCGTCACA ATGAGAACTT CCGGTTGAAC TAAAACTCTG GTTCGTCTGA ATGCTTAAAC 

AGGTACAAGC CCAAGAXTTG GGCCCCAACC CACTGACTGC TACTTGTAAA CTAACTCTTC 1140 
TCCATGTTCG GGTTCTAAAC COGGGGTTGG GTGACTGACG ATGAACATTT CATTGACAAG 

ATATACTTGA TGTAAAIGAT AATACCCCAG CCATCACTAT TACCCCTCTG ACTACTGtAA 1200 
TATATGAACT ACATMACTA TTATGGGGTC GGTAGTGAIA ATGGGGAGAC TGATGACATT 

ATGCAGGAGT TGOCTATATT CCAGAAACAG CCACAAAGGA GAACTTTATA GCTCTGATCA 1260 
TAOGTCCTGA AOGGATATAA GGTCTTTGTC GGTGTTTOCT CTTGAAATAT CGAGACTAGT 

GCACTACTGA CAGAGCCTCT GGATCTAATG GACAAGTTCG CTGTACTCTT TATGGACATG 1320 
CGTGATGACT GTCTOGGAGA CCTAGATTAC CTGTTCAAGC GACATGAGAA ATACCTCTAC 

AGCACTTTAA ACTACAGCAA GCTTATGAGG AGAGTTACAT GATAGTTACC ACCTCTACTT 1380 
TCGTGAAAR TGATGTCGTT CGAATACTCC TGTCAATGTA CTATCAATGG TGGAGATGAA 

TAGACAGGGA AAACAIAGCA GCGTACTCTT TGACAGTAGT TGCAGAAGAC CTTGGCTTOC 1440 
ATCTGTCCCT TTTGTATCGT CGCATGAGAA ACTGTCATCA AOGTCTTCTG GAACCGAAGG 

CCTCATTGAA GACCAAAAAG TACTACACAG TGAAGGHAG TGATGAGAAT GACAATGCAC 1500 
GGAGTAACTT CTGGTTTTTC ATGATGTGTC AGTTCCAATC ACTACTCRA CTGTTACGTG 

CTGTAWTTC TAAACCCCAG TATGAAGCTT CTATTCTGGA AAATAATGCT OCAGGCTCTT 1560 
GACAIAAAAG ATTTGGGGTC ATACTTCGAA GATAAGAOCT TTTATYACGA GGTCCGAGAA 

ATATAACTAC AGTGATAGCC AGAGACTCTG ATAGTGATCA AAATGGCAAA GTAAATTACA 1620 
TATATTGATG TGACTATCGG TCTCTGAGAC TATCACTAGT TTTACCGTTT CATTTAATGT 

GACTTGTGGA TGCAAAAGTG ATGGGCCAGT CACTAACAAC ATTTCTTTCT CTTGATGCGG 1680 
CTGAACACCT ACGTTTTCAC TACCCGGTCA GTGATTGTTG TAAACAAAGA GAACTACGCC 

ACTCTGGAGT ATTGAGAGCT GTTAGGTCTT TAGACTATGA AAAACTTAAA CAACTGGATT 1740 
TGAGACCTCA TAACTCTOGA CAATCCAGAA ATCTGATACT TTTTGAATTT GTTGAOCTAA 

TTGAAATTGA AGCTGGAGAC AAXGGGATCC CTCAACTCTC CACTOGOGTT CAACTAAATC 1800 
AACTTTAACT TCGACGTCTG TTXCOCtAGG GAGTTGAGAG GTGAGOGCAA GTTGATTTAG 

TCAGAATAGT TGATCAAAAT GATAATTGOC CTGTGAXAAC TAATCCTCTT CTTAAXAATG 1860 
AGTCTTATCA ACTAGTTTTA CTATTAACGG GACACTATTG ATTAGGAGAA GAATTATTAC 

GCTOGGGTGA AGTTCTGCTT CCGATCAGCG CTCCTCAAAA CTATTTAGTT TTCCAGCTCA 1920 
CGAGOOCACT TGAAGAOGAA GGGTAGTCGC GAGGAGTTTT GATAAATCAA AAGGTOGAGT 

AAGOOGAGGA TTCAGATGAA GGGCACAACT CCCAGCTGTT CTATAOCATA CTGAGAGATC 1980 
TTOGGCTCCT AAGTCTACTT CCOGTGTTGA GGGTOGACAA 6ATATGGTAT GACTCTCTAG 

CAAGCAGATT GTT TGOCATT AACAAAGAAA GTGGTGAAGT GTTOCTGAAA AAACAATTAA 2040 
GTTOGTCTAA CAAAGGGTAA TTGTTTCTTT CACCACTTCA CAAGGACTTT TTTGTTAATT 

ACTCIGAGCA TTCAGAGGAC TTGAGCATAG TAGTTGCAGT GTATGACTTG GGAAGACCTT 2100 
TGAGACTGGT AAGTCTCCTG AACTOGTATC ATCAACGTCA CATACTGAAC CCTTCTGGAA 

CATTATCCAC CAATGCTACA GTTAAATTCA TCCTGAGGGA CTCTTTTCCT TCTAACGTTG 2160 
GTAATAGGTG GTTAOGATGT CAATTTAAGT AGGAGTGGCT GAGAAAAGGA AGATTGCAAC 
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AAGTCGTTAT TTTGCAACCA TCTGCAGAAG AGCAGCACCA GATCGATATG TCCAMMM 2220 
TTCAGCAATA AAAOGTTGGT AGACGTCTTC TCG TC GTGGT CTAGCTAIAC AGGtAAXAXA 

TCATTGCAGT GCTGGCTGGT GGTTGTGCTT TGCTACTTTT GGCG&TCTTT TTTGTGGOCT 2280 
AGTAACGTCA CGACCGACCA CCAACACGAA ACGATGAAAA CCGGTAGAAA AAACACCGGA 

GTACTTGTAA AAAGAAAGCT GGTGAATTTA AGCAGGTACC TGAACAACAC GGAACATGCA 2340 
CATGAACATT TTTCTTTCGA CCACTTAAAT TOGTCCAXGG ACTTGTTGTG CCTTGTACGT 

ATGAAGAACG CCTGTTAAGC AGCCGAXCTC COGAGTCGGT CTCTTC3TCT TTGTCTCAGT 2400 
TACTTCTTGC GGACAATTCG TGGGGTAGAG GGGTCAGCCA GAGAAGAAGA AACAGAGTCA 

CTGAGTCATG CCAACTCTCC ATCAAXACTG AATCTGAGAA TTGCAGCGTG TCCTCTAACC 2460 
GACTCAGTAC GGTTGAGAGG TAGTTATGAC TTAGACTCTT AACGTCGCAC AGGAGATTGG 

AAGAGCAGCA TCAGCAAACA GGCAIAAAGC ACTCCATCTC TGTAOCATCT TATCACACAT 2520 
TTCTCGTCGT AGTOGTTTGT COGTATTTOG TGAGGTAGAG ACATGGTAGA AIAGTGTGTA 

CTGGTTGGCA CCTGGACAAT TGTGCAATGA GCATAAGTGG ACATTCTCAC ATGGGGCACA 2580 
GACCAACCGT GGACCTGTTA ACACGTTACT CGTATTCACC TGTAAGAGTG TACCOOGTGT 

TTAGTACAAA GGTACAGTGG GCAAAGGAGA TAGTGACTTC AATGACAGTG ACTCTGATAC 2640 
AATCATGTTT CCATGTCACC CGY TT CCTCT ATCACTGAAG TTACTGTCAC TGAGACTATG 

TAGTGGAGAA TCAGAAAAGA AGAGCATTGA GCAGCCAATG CAGGCACAAG CCAGTGCTCA 2700 
ATCAOCTCTT AGTCWTTCT TCTCGTAACT CGTCGGTTAC GTCCGTGTTC GGTCACGAGT 

ATACACAGAT GAATCAGCAG GGTTCOGACA TGCCGATAAC TATTTCAGCC ACCGAATCAA 2760 
TATGTGTCTA CTTAGTCGTC CCAAGGCTGT ACGGCTATTG ATAAAGTCGG TGGCTTAGTT 

CAAGGGTCCA GAAAATGGGA ACTGGACATT GCAAXATGAA AAGGGCTAXA GACTGTCTTA 2820 
GTCCCCAGGT CTTTTAOCCT TGACGTGTAA CGTTATACTT TTCCOGATAT CTGAGAGAAT 

CTCTGTAGCT OCTGTATATT ACAATACCTA CCAXGCAAGA ATGCCTAACC TGCACATACC 2680 
GAGACATCGA GGACATATAA TGTTATGGAT GGTAOGTTCT TAOGGATTGG AOGTGTATGG 

GAAOCATAOC CTTAGAGACC CTTATTACCA TATGAAXAAT CCTGTTGCTA ATCGGATGCA 2940 
CTTGGTATGG GAATCTCTGG GAATAATGGT ATAGTTATTA GGACAACGAT TAGOCTAOGT 

GGCGGAATAT GAAAGAGAW TAGtCAACAG AAGTGCAACG TTATCTCCGC AGAGATOGTC 3000 
CCGCCTTATA C TTTC TC TAA ATCRGTTGTC TTCACGTTGC AAXAGAGGOG TCTCTAGCAG 

TAGCAGATAC CAAGAATTGA ATTACAGTCC GCAGATATCA AGACAGCRC ATOCTTCAGA 3060 
ATOGTCTATG GTTCTTAAGT TAATGTCAGG CGTCTAIAGT TCTGTCGAAG TAGGAAGTCT 

AATTGCTACA ACCTTTTAAT CATTAGGCAT GCAAGTGAGA ATGCACAAAG GCAAGTGCTT 3120 
TTAAOGATGT TGGAAAAXtA GTAATCOGTA OGTTCACTCT TAOGTGTRC OGTTCAOGAA 

TAGCATGAAA GCTAAASATA TGGAGTCTCC CCTTTCCCTC TGATGGATGG GGGGAGACAC 3X80 
ATOGTACTTT CGATTTATAT ACCTCAGAGG GGAAAGGGAG ACTAOCIACC CCCCTCTGTG 

AGGACAGTGC ATAAATATAC AGCTGCTTTC TATTTGCATT TGACTTGGGA A TTTTT TG TT 3240 
TCCTGTCACG TATTTATATG TCGACGAAAG AXAAAGGTAA AGTGAAOCCT TAAAAAACAA 

TTTTTtACAT ATTTATTTTT CCTGAATTGA ATGTGACATT GTOCTGTCAC CTAACTAGCA 3300 
AAAAAA3CGTA TAAATAAAAA GGACTTAACT TACACTGTAA CAGGACAGTG GATTGATOGT 
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ATTAAATCCA G&G&OCSAC& GIG&A&X&XT TGAGGGOOOC TG&AACM3CA CASC&5SC&G 3360 
TAATTTAGGT GTCTGGATGT CAGTTTAIAA ACTCCCGGGG AC1TTGTOGT GTAGTCAGTC 

GACCTAAAGT GGCCTTTTTA CTTTTAGCAG CTCCTGGCTC TGCCCTCTGT GY^&ATC&GC 3420 
CTGGATTTCA CCGGAAAAAT GAAAUOGTC GAGGACCCAG ACGGGAGACA CAATTAGTCG 

CCCTGGTCAA GTOCTGAGTA GG&3GAJGGC G I ll t lA TAT GCATCTCACC TACTTTGGAC 3480 
OGGACCAGTT CAGGACXCAT CCTAGTACCG CAAAAATAIA CCTAGAGTGG AI6AAACCTC 

GTGATTTACA CAIAAIAGGA AACGCTTGG? TTCAGTGAAG TCTGTGTTGT AIATATTCTG 3540 
CACTAAATGT GTATTATCCT TTGCGAACCA AAGTCACTTC AGACACAACA TAIATAAGAC 

TTAIATACAC GCATTTTCTG TTTGTGTAIA TATTTCAAGT CCATTCAGAT ATGTGTATAT 3600 
AATAtATGTG CGTAAAACAC AAACACATAT ATAAAGTTCA GGTAAGTCTA TACACATATA 

AGTGCAGAGC ^TGTAAATTA AAXATTCTGA TACTTTTTCC TCAATAAATA TTTAAAT 
TCACGTCTGG AACATTTAAT TTATAAfiACT ATGAAAAAGG AGMATTTAT AAAXTTA 
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AAGCCTGGGA CCATGGTCTG CT6CGGCCC6 GGACGGATGC TGCTAGGATG GGCCGGGTTG €0 
TTCGGACCCT GGTACCAGAC GACGCCGGGC CCT6CCTACG ACGATCCTAC CCGGCCCAAC 

CTAGTCCTGG CTGCTCTCTG CCTGCTCCAG GTGCCCGGAG CTCAGGCTGC AGCCTGTGAG 120 
GATCAGGACC GACGAGAGAC GGACGAGGTC CAOGGGCCTC GAGTCCGACG TOGGACACTC 

CCTGTCCGCA TCCCGCTGT6 CAAGTCCCTT CCCTGGAACA TGACCAAGAT GCCCAACCAC 180 
GGACAGGCGT AGGGCGACAC GTTCAGGGAA GGGACCTTGT ACTGGTTCTA CGGOTTGGTG 

CTGCACCACA GCACCCAGGC TAACGCCATC CTGGCCATGG AACA6TTCGA AGGGCTGCTG 240 
GACGTGGTGT CGTGGGTCCG ATTGCGGTAG GACCGGTACC TTGTCAAGCT TCCCGAOGAC 

GGCACCCACT GCAGCCCGGA TCTTCTCTTC TTCCTCTGTG CAATGTACGC ACCCATTTGC 300 
CCGTGGGTGA CCTCGGGCCT AGAAGAGAAG AAGGAGACAC GTTACATGCG TGGGTAAACG 

ACCATCGACT TCCAGCACGA GCCCATCAAG CCCTGCAACT CTGTGTGTCA GCGCGCCOGA 360 
TGGTAGCTGA AGGTCGTGCT OGGGTAGTTC GGGACGTTCA GACACACACT CGCGCGGGCT 

CAGGGCTGOG AGCCCATTCT CATCAAGTAC CXXXACTCGT GGCCGGAAAG CTTGGCCTGC 420 
GTCCCGACGC TOGGGTAAGA GTAGTTCATG GCGGTGAGCA CCGGCCTTTC GAAOCGGAOG 

GAOGAGCTGC CGGTGTACGA COXX3GCGTG TGCATCTCTC CTGAGGCCAT CGTCACCGCG 480 
CTGCTCGACG GCCACATGCT GGCGCCGCAC AOGTAGAGAG GACTCCGGTA GCAGTGGCGC 

GACGGAGOGG ATTTTCCTAT GGATTCAAGT ACTGGACACT GCAGAGGGGC AAGCAGCGAA 540 
CTGOCTOGCC TAAAAGGATA CCTAAGTTCA TGACCTGTGA CGTCTCCCCG TTCGTCGCTT 

CGTTGCAAAT GTAAGCCTGT CAGAGCTACA CAGAAGACCT ATTTCCGGAA CAATTACAAC 600 
GCAACGTTTA CATTCGGACA GTCTGGATGT GTCTTCTGGA TAAAGGCCTT GTTAATGTTG 

TATGTCATCC GGGCTAAAGT TAAAGAGGTA AAGATGAAAT GTCATGATGT GACCGCCGTT 660 
ATACAGTAGG CCCGATTTCA ATTTCTCCAT TTCTACTTTA CAGTACTACA CTGGCGGGAA 

GTGGAAGTGA AGGAAATTCT AAAGGCATCA CTGGTAAACA TTCCAAGGGA CACCGTCAAT 720 
GAGCTTCACT TCCTTTAAGA TTTCCGTAGT GACCATTTGT AAGGTTCCCT GTGGCAGTTA 

CTTTATACCA CCTCTGGCTG CCTCTGTCCT COtCTTACTG TCAATGAGGA ATATGTCATC 780 
GAAATATGGT GGAGACCGAC GGAGACAGGA GGTGAATGAC AGTTACTCCT TATACAGTAG 

ATGGGCTATG AAGACGAGGA ACGTTCCAGG TTACTCTTGG TAGAAGGCTC TATAGCTGAG 840 
TACCCGATAC TTCTGCTCCT TGCAAGGTCC AATGAGAACC ATCTTCCGAG ATATCGACTC 

AAGTGGAAGG ATCGGCTTGG TAAGAAAGTC AAGCGCTGGG ATATGAAACT OCGACACCTT 900 
TTCACCTTCC TAGCCGAACC ATTCTTTCAG TTCGOGACCC TATACTTTGA GGCTGTGGAA 

GGACTGGGTA AAACTGATGC TAGCGATTCC ACTCAGAATC AGAAGTCTGG CAGGAACTCT 960 
CCTGACCCAT TTTGACTACG ATCGCTAAGG TGAGTCTTAG TCTTCAGACC GTCCTTGAGA 
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AATCCCCGGC CAGCACGCAG CTAAATCCTG AAATGTAAAA GGCCACACCC ACGGACTCCC 1020 
TTAGGGGCC6 GTCGTGOGTC GATTTAGGAC TTTACATTTT CCGGTGTGGG TGCCTGAGGG 

TTCTAAGACT GGCGCTGGTG GACTAACAAA GGAAAACCGC ACAGTTGTGC TCGTGACCGA 1080 
AAGATTCTGA CCGCGACCAC CTGATTGTTT CCTTTTG6CG TGTCAACACG AGCACTGGCT 

TTGTTTACCG CAGACACCGC GTGGCTACCG AAGTTACTTC OGGTCCCCTT TCTCCTGCTT 1140 
AACAAATGGC GTCTGTGGCG CACCGATGGC TTCAATGAAG GCCAGGGGAA AGAGGACGAA 

CTTAATGGCG TGGGGTTAGA TCCTTTAATA TGTTATATAT TCTGTTTCAT CAATCACGTG 1200 
GAATTACCGC ACCCCAATCT AGGAAATTAT ACAATATATA AGACAAAGTA GTTAGTGCAC 

GGGACTGTTC TTTTGCAACC AGAATAGTAA ATTAAATATG TTGATGCTAA GGTTTCTGTA 1260 
CCCTGACAAG AAAACGTTGG TCTTATCATT TAATTTATAC AACTACGATT CCAAAGACAT 

CTGGACTCCC TGGGTTTAAT TTGGTGTTCT GTACOCTGAT TGAGAATGCA ATGTTTCATG 1320 
GACCTGAGGG ACCCAAATTA AACCACAAGA CATGGGACTA ACTCTTACGT TACAAAGTAC 

TAAAGAGAGA ATCCTGGTCA TATCTCAAGA ACTAGATATT GCTGTAAGAC AGCCTCTGCT 1380 
ATTTCTCTCT TAGGACCAGT ATAGAGTTCT TGATCTATAA CGACATTCTG TCGGAGACGA 

GCTGCGCTTA TAGTCTTGTG TTTGTATGCC TTTGTCCATT TCCCTCATCC TGTGAAAGTT 1440 
CGACGCGAAT ATCAGAACAC AAAGATACGG AAAGAGGTAA AGGGAGTACG ACACTTTCAA 

ATACATGTTT ATAAAGGTAG AAOGGCATTT TGAAATCAGA CACTGCACAA GCAGAGTAGC 1500 
TATGTACAAA TATTTCCATC TTGCCGTAAA ACTTTAGTCT GTGACGTGTT CGTCTCATCG 

(XAACACCAG GAAGCATTTA TGAGGAAACG CCACACAGCA TGACTTATTT TCAAGATTGG 1560 
GGTTGTGGTC CTTCGTAAAT ACTCCTTTGC GGTGTGTCGT ACTGAATAAA AGTTCTAACC 

CAGGCAGCAA AATAAATAGT GTTGGGAGCC AAGAAAAGAA TATTTTGCCT GGTTAAGGGG 1620 

mxunxxm ttatttatca caaccctcgg ttcttttctt ataaaacgga ccaattcccc 

CACACTGGAA TCAGTAGCCC TTGAGCCATT AACAGCAGTG TTCTTCTGGC AAGTTTTTGA 1680 
GTGTGACCTT AGTCATCGGG AACTOGGTEAA TTGTOGTCAC AAOAAGACCG TTCAAAAACT 

TTTGTTCATA AATGTATTCA CGAGGATTAG AGATGAACTT ATAACTAGAC ATCTGTTGTT 1740 
AAACAAGTAT TTACATAAGT GCTCGTAATC TCTACTTGAA TATTGATCTG TAGACAACAA 

ATCTCTATAG CTCTGCTTCC TTCTAAATCA AACCCATTGT TGGATGCTCC CTCTCCATTC 1800 
TAGAGATATC GAGACGAAGG AAGATTTAGT TTGGGTAACA ACCTACGAGG GAGAGGTAAG 
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ATAAATAAAT TTGGCTTGCT GTATTGGCCA GGAAAAGAAA GTAITTAAAGT ATGCATGCAT I860 
TATTTATTTA AACCGAACGA CATAACCGGT CCTTTTCTTT CATAATTTCA TACGTACGTA 

GTGCACCAGG GTGTTATTTA ACAGAGGTAT GTAACTCTAT AAAAGACTAT AATTTACAGG 1920 
CACGTGGTCC CACAATAAAT TGTCTCCATA CATTGAGATA TTTTCTGATA TTAAATGTCC 

ACACGGAAAT GTGCACATTT GTTTACTTTT TrTCT T CC TT TTGCTTTGGG CTTGTGATTT 1980 
TGTGCCTTTA CACGTGTAAA CAAATGAAAA AAAGAAGGAA AACGAAACCC GAACACTAAA 

TGGTTTTTGG TGTGTTTATG TCTGTATTTT GGGGGGTGGG TAGGTTTAAG CCATTGCACA 2040 
ACCAAAAACC ACACAAATAC AGACATAAAA CCCCCCACCC ATCCAAATTC GCTAACGTGT 

TTCAAGTTGA ACTAGATTAG AGTAGACTAG GCTCATTGGC CTAGACATTA TGATTTGAAT 2100 
AAGTTCAACT TGATCTAATC TCATCTGATC CGAGTAACCG GATCTGTAAT ACTAAACTTA 

TTGTGTTGTT TAATGCTCCA TCAAGATGTC TAATAAAAGG AATATGGTTG TCAACAGAGA 2160 
AACACAACAA ATTACGAGGT AGTTCTACAG ATTATTTTCC TTATACCAAC AGTTGTCTCT 

CGACAACAAC AACAAA 
GCTGTTGTTG TTGTTT 
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MVCGSPGGML LLRAaTiLAIA ALCLLKVPGA RAAACEPVRI PLCKSLPWNM TKMPNHLHHS 60 

TQANATTiATR QFEGLLGTHC SPDLLFFLCA MYAPICTIDF QHBPIKPCKS VCERARQGCE 120 

PILIKYRHSW PENLACEELP VYDRGVCISP EMVTADGAD FPMDSSNQNC RGASSERCKC 180 

KPIRATQKTY FRNNYNYVTR AKVKEIKTKC HDVTAWEVK EILKSSLVNI PRDTVNLYTS 240 

SGCLCPPLNV NEEYHMQYE DEERSRLLLV EGSIAEKWKD RLGKKVKRHD MKLRHLGLSK 300 
SDSSNSDSTQ SQKSGRNSNP RQARN. 



Figure 9 

SUBSTITUTE SHEET (RULE 26) 



WO 97/48275 PCT/US97/10942 

17/18 



GGCGGAGCGG GCCTTTTGGC GTCCACTGCG CGGCTGCACC CTGCCCCATC TCCCGGGATC 60 
CCGOCTCGCC CGGAAAACGG CAGGTGACGC GCCGACGTG6 GACGGGGTAG ACGGCCCTAG 

ATGGTCTGCG GCAGCCCGGG AGGGATGCTG CTGCTGCGGG CCGGGCTGCT TGCCCTGGCT 120 
TACCAGACGC CGTCGGGCCC TCCCTACGAC GACGACGCOC GGCOCGAOGA ACGGGACCGA 

GCTCTCTGCC TGCTCCGGGT GCCCGGGGCT CGG6CT6CAG CCTGTGAGCC CGTCCGCATC 180 
CGAGAGACGG ACGAGGCCCA OGGGCCCCGA GOCCGACGTC GGACACTCGG GGAGGCGTAG 

CCCCTGTGCA AGTCCCTGCC CTGGAACATG ACTAAGATGC CCAACCACCT GCACCACAOC 240 
GGGQACACGT TCAGGGACGG GACCTTCTAC TGATTCTACG GGTTGGTGGA CGTGGTGTCG 

ACTCAGGCCA ACGCCATCCT GGCCATCGAG CAGTTCGAAG GTCTGCTGGG CACOCACTGC 300 
TGAGTOCGCT TGCGGTAGGA CCGGTAGCTC GTCAAGCTTC CAGACGACCC GTGGGTGACG 

AGCCCCGATC TGCTCTTCTT CCTCTGTGCC ATGTACGCGC CCATCTGCAC CATTGACTTC 360 
TCGGGGCTAG ACGAGAAGAA GGAGACAOGO TACATCCGCG GGTAGACGTG GTAACTGAAG 

CAGCACGAGC CCATCAAGCC CTCTAAGTCT GTGTGCGAGC GGGCCOGGGA OGGCTGTGAG 420 
GTCGTGCTCG GGTAGTTCGG GACATTCAGA CACACGCTCG CCCGGGCCQT OCOGACACTC 

CCCATACTCA TCAAGTACCG CCACTCGTGG CCGGAQAACC TGGCCTGOGA GGAGCTGCCA 480 
GGGTATCAOT AGTTCATGGC GGTGAGCACC GGCCTCTTGG ACCGGACGCT CCTCGACGGT 

GTGTACGACA GGGGCGTGTG CATCTCTCCC GAGGCCATCG TTACTGCGGA CGGAGCTGAT 540 
CACATGCTGT OCCOGCAGAC GTAGAGAGGG CTOCGGTAGC AATGACGCCT GCCTCGACTA 

TTTCCTATGG ATTCTAQTAA CGGAAACTGT AGAGGGQCAA GCAG7GAACG CTGTAAATGT 600 
AAAGGATACC TAAGATCATT GCCTTTQACA TCTCCCCGTT CGTCACTTGC GACATTTACA 

AAGCCTATTA GAOCTACACA GAAGACCTAT TTCOGGAACA ATTACAACTA TGTCATTCGG 660 
TTCGGATAAT CTCGATGTCT CTTCTGGATA AAGGCCTTGT TOATCTTGAT ACAGTAAGCC 

GCTAAAGTTA AAGAGATAAA GACTAAGTCC CATGATGTGA CTGCAGTAGT GGAGGTGAA6 720 
CGATTTCAAT TTCTCTATTT CTQATTCACG CTACTACACT GACGTCATCA CCTCCACTTC 

GAGATTCTAA AGTCCTCTCT GGTAAACATT CCACGGGACA CTGTCAACCT CTAXACCAGC 780 
CTCTAAGATT TCAGGAGAGA CCATTTGTAA GGTGCCCTGT GACAGTTGGA OATATGGTCG 

TCTGGCTGOC TCTGCCCTCC ACTTAATOTT AATGAGGAAT ATATCATCAT GGGCTATGAA 840 
AGACCGACGG AGACGGGAGG TGAATTACAA TTACTCCTTA TATAOTAGTA CCCGATACTT 
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GATGAGGAAC GTTCCAGATT ACTCTTGGTG GAAGGCTCTA TAGCTGAGAA GTGGAAGGAT 900 
CTACTCCTTG CAAGGTCTAA TGAGAACCAC CTTCCGAGAT ATCGACTCTT (^CCTTCCTA 

CGACTCGGTA AAAAAGTTAA GCGCTGGGAT ATGAAGCTTC GTCATCTTGG ACTCAGTAAA 960 
GCTGAGCCAT TTTTTCAATT CGCGACCCTA TACTTCGAAG CAGTAGAACC TGAGTCATTT 

AGTGATTCTA GCAATAGTGA TTCCACTCAG AGTCAGAAGT CTGGCAGGAA CTCGAACCCC 1020 
TCACTAAGAT CGTTATCACT AAGGTGAGTC TCAGTCTTCA GACCGTCCTT GAGCTTGGGG 

CGGCAAGCAC GCAACTAAAT CCCGAAATAC AAAAAGTAAC ACAGTGGACT TCCTATTAAG 1080 
GCCGTTCGTG CGTTGATTTA GGGCTTTATG TTTTTCATTG TGTCACCTGA AGGATAATTC 

ACTTACTTGC ATTGCTGGAC TAGCAAAGGA AAATTGCACT ATTGCACATC ATATTCTATT 1140 
TGAATGAACG TAACGACCTC ATCGTTTCCT TTTAACGTGA TAACGTGTAG TATAAGATAA 

GTTTACTATA AAAATCATGT GATAACTGAT TATTACTTCT GTTTCTCTTT TGGTTTCTCC 1200 
CAAATGATAT TTTTAGTACA CTATTGACTA ATAATGAAGA CAAAGAGAAA ACCAAAGACG 

TTCTCTCTTC TCTCAACCCC TTTGTAATGG TTTGGGGGCA GACTCTTAAG TATATTGTGA 1260 
AAGAGAGAAG AGAGTTGGGG AAACATTACC AAACCCCCGT CTGAGAATTC ATATAACACT 

GTTTTCTATT TCACTAATCA TGAGAAAAAC TGTTCTTTTG CAATAATAAT AAATTAAACA 1320 
CAAAAGATAA AGTGATTAGT ACTCTTTTTG ACAAGAAAAC GTTATTATTA TTTAATTTGT 

TGCTGTTACC AGAGCCTCTT TGCTGAGTCT CCAGATGTTA ATTTACTTTC TGCACCCCAA 1380 
ACGACAATGG TCTCGGAGAA ACGACTCAGA GGTCTACAAT TAAATGAAAG ACGTGGGGTT 

TTGGGAATGC AATATTGGAT GAAAAGAGAG GTTTCTGGTA TTCACAGAAA GCTAGATATG 1440 
AACCCTTACG TTATAACCTA CTTTTCTCTC CAAAGACCAT AAGTGTCTTT CX3ATCTATAC 

CCT TAAAAC A TACTCTGCCG ATCTAATTAC AGCCTTATTT TTGTATGCCT TTTGGGCATT 1500 
GGAATTTTGT ATGAGACGGC TAGATTAATG TCGGAATAAA AACATACGGA AAACCCGTAA 

CTCCTCMGC TTAGAAAGTT . CCAAATCTTT ATAAAGGTAA AATGGCAGTT TGAAGTCAAA 1560 
GAGGAGTACG AATCTTTCAA GGTTTACAAA TATTTCCATT TTACCGTCAA ACTTCAGTTT 

TGTCACATAG GCAAAGCAAT CAAGCACCAG GAAGTGTTTA TGAGGAAACA ACACCCAAGA 1620 
ACAGTGTATC OGTTTOGTTA GTTCGTGGTC CTTCACAAAT ACTCCTTTGT TO TCGG TTCT 

TGAATTATTT TTGAGACTGT CAGGAAGTAA AATAAATAGG AGCTTAAGAA AGAACATTTT 1680 
ACTTAATAAA AACTCTGACA GTCCTTCATT TTATTTATCC TCGAATTCTT TCTTGTAAAA 

GCCTGATTGA GAAGCACAAC TGAAACCAGT AGCCGCTGGG GTGTTAATGG TAGCATTCTT 1740 
CGGACTAACT CTTCGTGTTG ACTTTGGTCA TCGGOGACCC CACAATTACC ATCGTAAGAA 

CTTTTGGCAA TACATTTGAT TTGTTCATGA ATATATTAAT CAGCATTAGA GAAATGAATT 1800 
GAAAACCGTT ATGTAAACTA AACAAGTACT TATATAATTA GTCGTAATCT CTTTACTTAA 

ATAACTAGAC ATCTGCTGTT ATCACCATAG TTTTGTTTAA TTTGCTTCCT TTTAAATAAA 1860 
TATTGATCTG TAGACGACAA TAGTGGTATC AAAACAAATT AAACGAAGGA AAATTTATTT 

CCCATTGGTG AAAGTCAAAA AAAAAAAAAA AAA 
GGGTAACCAC TTTCAGTTTT TTTTTT T T TT TTT 
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